E6 binding proteins

ABSTRACT

E6-BP polypeptides, nucleic acids encoding E6-BP polypeptides, and uses thereof.

GOVERNMENT FUNDING

Work described herein was supported in part by funding from the National Institute of Health. The United States Government has certain rights in the invention.

RELATED APPLICATIONS

This application is a continuation-in-part of U.S. Ser. No. 08/273,059, entitled "E6 Binding Proteins" which was filed on Jul. 8, 1994, now abandoned, the specification of which are incorporated by reference herein.

BACKGROUND OF THE INVENTION

Papillomaviruses (PV) have been linked to widespread, serious human diseases, especially carcinomas of the genital and oral mucosa. It is estimated that there are currently somewhere in the neighborhood of tens of millions of women who suffer from human papilloma virus (HPV) infection of the genital tract. Many of these women eventually develop cancer of the cervix. For example, it has been estimated that about twenty percent (20%) of all cancer deaths in women worldwide are from cancers which are associated with HPV. It has also been estimated that 90% of all cervical cancer is linked to HPV.

Papillomaviruses induce benign, dysplastic and malignant hyperproliferations of skin or mucosal epithelium (see, for example, Mansur and Androphy, (1993) Biochim Biophys Acta 1155:323-345; Pfister (1984) Rev. Physiol. Biochem. Pharmacol. 99:111-181; and Broker et al. (1986) Cancer Cells 4:17-36, for reviews of the molecular, cellular, and clinical aspects of the papillomaviruses). Almost 70 human papillomavirus types have been identified, and different papillomavirus types are known to cause distinct diseases, Pfister, (1987) Adv. Cancer Res., 48:113-147, Syrjanen, (1984) Obstet. Gynecol. Survey 39:252-265. Human papillomaviruses (HPVs) are a heterogeneous group of DNA tumor viruses associated with hyperplastic (warts, condylomata), pre-malignant and malignant lesions (carcinomas) of squamous epithelium. For example, HPV types 1 and 2 cause common warts, and types 6 and 11 cause warts of the external genitalia, anus and cervix. HPV, types 16, 18, 31 and 33 have been isolated from the majority of cervical cancers with HPV-16 present in about 50 percent of all cervical cancers. These HPV's are referred to as "high risk". While HPV 6 and 11 are the most common isolates for cervical warts, these infections rarely progress to invasive cancer, and therefore these HPV's are referred to as "low risk".

Studies of viral gene expression in carcinomas suggest the importance of two HPV encoded proteins, E6 and E7, in malignant development and these proteins have been shown to encode transforming and immortalizing activities. The two proteins show some functional resemblance to the transforming proteins of other small DNA tumor viruses such as adenovirus and SV40. E7 shares functional and structural features with the adenovirus E1A proteins. Like Ad E1A and the large T proteins of the polyomaviruses, E7 can complex pRB. Likewise, the E6 oncoprotein encoded by the "high risk" HPV's can form a complex with p53. In vitro, E6 promotes the degradation of p53 and this degradation involves the ubiquitin-dependent protease system. The selective degradation of cellular negative regulatory proteins such as p53 regulatory functions provides an explanation of the action for dominant acting oncoproteins. The relevance of the inactivation of the normal functions of pRB and p53 in human cervical carcinogenesis has recently been demonstrated by the analysis of these two genes and their products in a series of HPV-positive and HPV-negative cell lines. These studies support the notion that the inactivation of the normal functions of the tumor suppressor proteins pRB and p53 are important steps in human cervical carcinogenesis, either by mutation or through complex formation with HPV E6 and E7 oncoproteins.

SUMMARY OF THE INVENTION

The present invention relates to the discovery in eukaryotic cells, particularly human cells, of novel protein-protein interactions between the papillomavirus transforming protein E6 and certain cellular proteins, referred to hereinafter as "E6-binding proteins" or "E6-BP".

In general, the invention features a E6-BP^(SD-7) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-7) polypeptide, or a recombinant E6-BP^(SD-7) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 70%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 8; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 8; the polypeptide is at least 5, 10, 20, 25, 30, 40, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, 25, 30, 40, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 8; the E6-BP^(SD-7) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-BP, e.g., of the regulation of cell proliferation; the polypeptide includes an E6-binding motif corresponding to Ala 194-Asp 218 of SEQ. ID No. 8.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus. In preferred embodiments, the antagonist is a fragment of the full-length SD-7 protein, which fragment, for example, retains the ability to bind E6 and competitively inhibits binding of the full-length SD-7 protein. For example, fragments containing the E6-binding motif corresponding to about Ala 194-Asp 218 can be provided as antagonists of the full-length protein.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-7) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 8, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

In preferred embodiments of a protein homologous to SEQ ID No: 8, the protein has a molecular weight of approximately 50 kilodaltons, e.g. in the range of 45-55 kD, e.g. in the range of 48-52 kD.

In preferred embodiments: the peptide includes at least 1, 2, 3, or 5, and preferably 10, 20, and 30, amino acid residues from residues 1-133 of Sequence ID No: 8.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 8.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-8) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-8) polypeptide, or a recombinant E6-BP^(SD-8) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 9; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 9; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 9; the E6-BP^(SD-8) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-) 8 polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 9, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 9.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-12) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-12) polypeptide, or a recombinant E6-BP^(SD-12) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 10; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 10; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 10; the E6-BP^(SD-12) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-12) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 10, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 10.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-16) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-16) polypeptide, or a recombinant E6-BP^(SD-16) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 11; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 11; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 1 1; the E6-BP^(SD-16) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV- 16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-16) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 11, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 11.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-22) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-22) polypeptide, or a recombinant E6-BP^(SD-22) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 12; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 12; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 12; the E6-BP^(SD-22) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-22) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 12, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 12.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-28) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-28) polypeptide, or a recombinant E6-BP^(SD-28) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 13; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 13; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 13; the E6-BP^(SD-28) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-28) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 13, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 13.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

In another aspect, the invention features a E6-BP^(SD-32) polypeptide, preferably a substantially pure preparation of an E6-BP^(SD-32) polypeptide, or a recombinant E6-BP^(SD-32) polypeptide. In preferred embodiments: the polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 14; the polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 14; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 14; the E6-BP^(SD-32) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the invention includes E6-binding proteins with antagonistic activity, and which preferably are capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, a peptide having at least one biological activity of the subject E6-BP^(SD-32) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 14, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, E6-binding protein is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Yet another aspect of the present invention concerns an immunogen comprising an E6-BP polypeptide in an immunogenic preparation, the immunogen being capable of eliciting an immune response specific for said E6-BP polypeptide; e.g. a humoral response, e.g. an antibody response; e.g. a cellular response. In preferred embodiments, the immunogen comprising an antigenic determinant, e.g. a unique determinant, from a protein represented by SEQ ID No: 14.

A still further aspect of the present invention features an antibody preparation specifically reactive with an epitope of the E6-BP immunogen.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-7) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 8; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 8; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 8; the encoded E6-BP^(SD-7) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation; the polypeptide includes an E6-binding motif corresponding to Ala 194-Asp 218 of SEQ. ID No. 8. In preferred embodiments, the nucleic acid includes a nucleotide sequence corresponding to nucleotide residues 580-654 of SEQ. ID No. 1.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-7) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 8, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-7) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-7) gene sequence, e.g., to render the E6-BP^(SD-7) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP^(SD-7) polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 1; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 1; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 1. In yet a further preferred embodiment, the E6-BP encoding nucleic acid hybridizes to a nucleic acid probe corresponding to a subsequence encoding at least 4 consecutive amino acids, more preferably at least 10 consecutive amino acid residues, and even more preferably at least 20 amino acid residues between residues 1 and 133 of SEQ ID No: 8.

In preferred embodiments: the nucleic acid sequence includes at least 1, 2, 3 or 5, and preferably at least 10, 20, 50, or 100 nucleotides from the region of SEQ ID No: 1 which encodes amino acid residues 1-133 of SEQ ID No: 8; the encoded peptide includes at least 1, 2, 3, 5, 10, 20, or 30 amino acid residues from amino acid residues 1-133 of SEQ ID No: 8; the nucleic acid sequence is other than nucleotide residues 572-875 of SEQ ID No: 1, e.g., it is longer, shorter, has a different 3' end or a different 5' end. In preferred embodiments, the nucleic acid nevertheless includes a coding sequence for an E6-binding motif, such as corresponding to the E6-binding motif encoded by nucleotide residues 580-654 at SEQ. ID No. 1.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-8) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 9; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 9; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 9; the encoded E6-BP^(SD-8) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV- 16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-1 6, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-8) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 9, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-8) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-8) gene sequence, e.g., to render the E6-BP^(SD-8) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 2; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 2; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 2.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-12) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 10; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 10; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 10; the encoded E6-BP^(SD-12) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV- 16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV- 16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-12) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 10, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-12) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-12) gene sequence, e.g., to render the E6-BP^(SD-12) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 3; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 3; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 3.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-16) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 11; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 11; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 11; the encoded E6-BP^(SD-16) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV- 16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-16) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 11, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-16) nucleic acid will include a transcriptional regulatory sequence, e.g., at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-16) gene sequence, e.g., to render the E6-BP^(SD-16) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 4; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 4; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 4.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-22) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 12; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 12; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 12; the encoded E6-BP^(SD-22) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-22) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 12, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-22) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-22) gene sequence, e.g., to render the E6-BP^(SD-22) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 5; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 5; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 5.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-28) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 13; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 13; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 13; the encoded E6-BP^(SD-28) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV-16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-28) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 13, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-28) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-28) gene sequence, e.g., to render the E6-BP^(SD-28) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 6; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 6; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 6.

Another aspect of the present invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an E6-BP^(SD-32) polypeptide. In preferred embodiments: the encoded polypeptide has biological activity, e.g., it specifically binds a papillomavirus E6 protein; the encoded polypeptide has an amino acid sequence at least 60%, 80%, 90% or 95% homologous to the amino acid sequence in SEQ ID No: 14; the encoded polypeptide has an amino acid sequence essentially the same as the amino acid sequence in SEQ ID No: 14; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids from SEQ ID No: 14; the encoded E6-BP^(SD-32) polypeptide is either, an agonist or an antagonist of a biological activity of an E6-PB, e.g., of the regulation of cell proliferation.

In preferred embodiments the encoded polypeptide has antagonistic activity, and is preferably capable of: suppressing tumor growth, e.g. in a tumor cell in which endogenous E6-BP is misexpressed; suppressing growth of papillomavirus-infected cells, e.g. HPV-infected cells; blocking or inducing apoptosis; inhibiting growth of a papillomavirus-infected cell, e.g. an HPV-infected cell, e.g. a high-risk HPV infected cell, e.g. and HPV- 16, -18, -31, or -33 infected cell, e.g. a bovine papillomavirus (BPV)-infected cell; inhibiting infection of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV- 16, -18, -31, or -33, e.g. a bovine papillomavirus (BPV); inhibiting transformation of a cell by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus; or inhibiting immortalization of a cell, e.g. a human cell, by a papillomavirus, e.g. an HPV, e.g. a high-risk HPV, e.g. and HPV-16, -18, -31, or -33, e.g. a bovine papillomavirus.

In a preferred embodiment, the encoded peptide having at least one biological activity of the subject E6-BP^(SD-32) polypeptide may differ in amino acid sequence from the sequence in SEQ ID No: 14, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein.

In yet other preferred embodiments, the encoded polypeptide is a recombinant fusion protein which includes a second polypeptide portion, e.g., a second polypeptide having an amino acid sequence unrelated to a protein represented by one of SEQ ID Nos: 8-14, e.g. the second polypeptide portion is glutathione-S-transferase, e.g. the second polypeptide portion is a DNA binding domain, e.g. the second polypeptide portion is a polymerase activating domain, e.g. the fusion protein is functional in a two-hybrid assay.

Furthermore, in certain preferred embodiments, the subject E6-BP^(SD-32) nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the E6-BP^(SD-32) gene sequence, e.g., to render the E6-BP^(SD-32) gene sequence suitable for use as an expression vector.

In yet a further preferred embodiment, the nucleic acid which encodes an E6-BP polypeptide of the invention, which hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No: 7; more preferably to at least 20 consecutive nucleotides of SEQ ID No: 7; more preferably to at least 40 consecutive nucleotides of SEQ ID No: 7.

The invention also features transgenic non-human animals, e.g. mice, rats, rabbits or pigs, having a transgene, e.g., animals which include (and preferably express) a heterologous form of one of the novel E6-BP genes described herein, e.g. a gene derived from humans, or a gene which misexpress an endogenous E6-BP gene, e.g., an animal which expression of one or more of the subject E6-BP's is disrupted. Such a transgenic animal can serve as an animal model for studying cellular disorders comprising mutated or mis-expressed E6-BP alleles or for use in drug screening.

The invention also provides a probe/primer comprising a substantially purified oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence of one of SEQ ID Nos: 1-7, or naturally occurring mutants thereof. In preferred embodiments, the probe/primer further includes a label group attached thereto and able to be detected. The label group can be selected, e.g., from a group consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. Probes of the invention can be used as a part of a diagnostic test kit for identifying transformed cells, such as for measuring in a sample of cells isolated from a patient, a level of a nucleic acid encoding one of the subject E6-binding proteins; e.g. measuring the E6-BP mRNA level in a cell; e.g. determining whether the genomic E6-BP gene has been mutated or deleted. Preferably the oligonucleotide is at least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length.

In yet another aspect, the invention provides an assay for screening test compounds for an interaction; e.g., inhibition, of an E6-BP polypeptide with a papillomavirus E6 protein. The method includes the steps of (i) combining a viral E6 protein, an E6-BP, e.g., an E6-BP of the invention (e.g. a protein expressed from one of the clones selected from the group SD-7, SD-8, SD-12, SD-1 6, SD-22, SD-28 or SD-32, or keratin-17, apoferritin, a nucleophosamin, a ribonucleoprotein, a proteasome subunit, a complement decay-accelerating factor), and a test compound, e.g., under conditions wherein in the absence of the test compound the E6 protein and the E6-binding protein are able to interact; and (ii) detecting the formation of a complex which includes the E6 protein and the E6-binding protein. A change, e.g., a decrease, in the formation of the complex in the presence of a test compound (relative to what is seen in the absence of the test compound) is indicative of a modulation, e.g., an inhibition, of the interaction between the E6 protein and the E6-binding protein. In preferred embodiments: the E6 protein is an HPV E6 protein, e.g. from a high-risk HPV, e.g. from HPV-16, -18, -31 or -33; the E6 protein is a BPV E6 protein; the E6 protein and the E6-binding protein are combined in a cell-free system and contacted with the test compound; i.e. the cell-free system is selected from a group consisting of a cell lysate and a reconstituted protein mixture; the E6 protein and the E6-binding protein are simultaneously expressed in a cell, and the cell is contacted with the test compound, e.g. the E6 protein and the E6-binding protein comprise an interaction trap assay (two hybrid assay). In preferred embodiments, the assay is generated using a protein including an E6-binding motif, e.g. an E6-binding motif which corresponds to about Ala 194-Asp 218.

The present invention also provides a method for treating an animal having unwanted cell growth characterized by a loss of wild-type function of one or more of the subject E6-binding proteins, comprising administering a therapeutically effective amount of an agent able to inhibit the interaction of the E6-binding protein with other cellular or viral proteins. In one embodiment, the method comprises administering a nucleic acid construct encoding a polypeptides represented in one of SEQ ID Nos: 8-14, under conditions wherein the construct is incorporated by cells deficient in that E6-binding protein, and under conditions wherein the recombinant gene is expressed, e.g. by gene therapy techniques. In another embodiment, antagonistic fragments of an E6-binding protein can be provided in a pharmaceutical preparation and used to treat such disorders. For example, antagonistic forms of SD-7, such as may include an E6-binding motif, can be used to treat papillomavirus-infected and/or transformed cells by cutaneous application of the polypeptide, preferably small polypeptides, e.g., 2-50 amino acids in total length.

Another aspect of the present invention provides a method of determining if a subject, e.g. a human patient, is at risk for a disorder characterized by unwanted cell proliferation. The method includes detecting, in a tissue of the subject, the presence or absence of a genetic lesion characterized by at least one of (i) a mutation of a gene encoding a protein represented by one of SEQ ID Nos: 8-14, or a homolog thereof; or (ii) the mis-expression of a gene encoding a protein represented by one of SEQ ID Nos: 8-14. In preferred embodiments: detecting the genetic lesion includes ascertaining the existence of at least one of: a deletion of one or more nucleotides from the E6-BP gene; an addition of one or more nucleotides to the gene, an substitution of one or more nucleotides of the gene, a gross chromosomal rearrangement of the gene; an alteration in the level of a messenger RNA transcript of the gene; the presence of a non-wild type splicing pattern of a messenger RNA transcript of the gene; or a non-wild type level of the protein.

For example, detecting the genetic lesion can include (i) providing a probe/primer including an oligonucleotide containing a region of nucleotide sequence which hybridizes to a sense or antisense sequence of one of SEQ ID Nos: 1-7, or naturally occurring mutants thereof or 5' or 3' flanking sequences naturally associated with the E6-BP gene; (ii) exposing the probe/primer to nucleic acid of the tissue; and (iii) detecting, by hybridization of the probe/primer to the nucleic acid, the presence or absence of the genetic lesion; e.g. wherein detecting the lesion comprises utilizing the probe/primer to determine the nucleotide sequence of the E6-BP gene and, optionally, of the flanking nucleic acid sequences; e.g. wherein detecting the lesion comprises utilizing the probe/primer in a polymerase chain reaction (PCR); e.g. wherein detecting said lesion comprises utilizing the probe/primer in a ligation chain reaction (LCR). In alternate embodiments, the level of said protein is detected in an immunoassay using an antibody which is specifically immunoreactive with, e.g. a protein represented by one of SEQ ID Nos: 8-14.

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See, for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Pat. No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986).

Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.

DESCRIPTION OF THE DRAWINGS

FIG. 1 is the general map of the pRS306 plasmid (Sikorski et al. (1989) Genetics 122:19) used to construct the VP 1 6/cDNA fusion gene for expression in a two hybrid assay.

FIGS. 2 and 3 are tables showing the effect on E6 binding of various deletion and truncations of the SD-7 protein.

DETAILED DESCRIPTION OF THE INVENTION

The papillomaviruses (PV) are infectious agents that can cause benign epithelial tumors, or warts, in their natural hosts. Of particular relevance to the field of human cancer, infection with specific human papillomaviruses (HPV) has been associated with the development of human epithelial malignancies, including that of the uterine cervix, genitalia, skin and less frequently, other sites. Two of the transforming proteins produced by papillomaviruses, the E6 protein and E7 protein, form complexes with the tumor suppressor gene products p53 and Rb, respectively, indicating that these viral proteins may exert their functions through critical pathways that regulate cellular growth control.

However, our studies of immortalization by the papillomavirus E6 protein show that while p53 binding is necessary for efficient immortalization, the E6 protein has additional properties in the establishment of an immortal state. Moreover, our findings suggest both BPV and "low risk" HPV E6 proteins have one or more functions in common with the high risk HPV E6 protein. We have adapted the two hybrid assay (U.S. Pat. No. 5,283,173) to identify other human cellular proteins which interact with the viral E6 oncoprotein and which could be candidate proteins participating in PV infectivity and/or transformation.

Starting with a yeast strain expressing the HPV-16 E6 gene fused at its C-terminus to the BPV E2 DNA-binding domain, and further containing a lacZ reporter construct driven by a promoter containing four E2 binding elements, we generated a two hybrid assay for screening human cDNA libraries. This strain was then transformed with a library of yeast shuttle vector plasmid in which randomly primed HeLa cell cDNA was inserted C-terminal to the strong VP 16 transcription activation domain. Interaction of the VP16/cDNA fusion proteins with the E6/E2 hybrid protein would recruit the VP16 transcriptional activation domain to the E2 binding site and activate expression of the lacZ gene. This has led to the successful isolation of a number of human genes which encode proteins that specifically interact with E6. The invention, as described below, therefore derives, in part, from the discovery that, in addition to the tumor suppressor protein "p53" and the cellular protein "E6-AP", the papillomavirus transforming protein E6 is also associated with several other cellular proteins (hereinafter termed "cellular E6-binding proteins" or "E6-BPs"), which association is presumably important to the pathogenesis of papillomavirus infection and papillomavirus-mediated disease states. For example, association of one of the subject E6-binding proteins with E6 can result in alteration of the localization of either or both proteins, a change in the biological activity of the protein, a modification of the cellular half-life of the protein, or a combination thereof. Thus, embodiments of the invention make available diagnostic and therapeutic assays and reagents for detecting and treating papillomavirus-infected cells.

For example, each of the subject E6-BPs can be used as the basis of assays for identifying agents that alter, e.g. decrease, the ability of a particular E6-binding protein to bind a papillomavirus E6 protein and thereby, through inhibition of E6-BP/E6 complexes, inhibit papillomavirus infection, transformation and/or immortalization. Such agents can be of use therapeutically to prevent E6-BP/E6 complexes in cells infected by, for example, human papillomaviruses, e.g. HPV-1, HPV-2, HPV-3, HPV-4, HPV-5, HPV-6, HPV-7, HPV-8, HPV-9, HPV-10, HPV-1 1, HPV-12, HPV-14, HPV-13, HPV-15, HPV-16, HPV-17 or HPV- 18, particularly high-risk HPVs, such as HPV- 16, HPV- 18, HPV-3 1 and HPV-3 3. Contacting such cells with agents that alter the formation of one or more E6-BP/E6 complexes can inhibit pathological progression of papillomavirus infection, such as preventing or reversing the formation of warts, e.g. Plantar warts (verruca plantaris), common warts (verruca plana), Butcher's common warts, flat warts, genital warts (condyloma acuminatum), or epidermodysplasia verruciformis; as well as treating papillomavirus cells which have become, or are at risk of becoming, transformed and/or immortalized, e.g. cancerous, e.g. a laryngeal papilloma, a focal epithelial, a cervical carcinoma.

In one embodiment, the cellular E6-binding protein is a cytokeratin, and binding of the PV E6 protein to the cytokeratin may contribute to, for example, collapse of the cytokeratin matrix, disruption of the cell envelope, disruption of the spatial organization of the infected epithelial tissue, and/or loss of focal adhesion by the PV-infected cells. In a preferred embodiment, the cytokeratin is keratin-17. Disruption of the E6/cytokeratin interaction might therefore influence infectivity of papillomavirus, e.g. by affecting release of viral particles from infected cells, as well as, by preventing PV-induced alteration of epithelial phenotype, e.g. of keratinocytes, e.g. of squamous epithelium, e.g. of stratified squamous epithelium.

In another embodiment, the cellular E6-binding protein is a nucleolar protein involved in cell proliferation, such as for example, transcriptional regulation, mRNA processing, mRNA localization, or ribosome maturation, and binding of papillomavirus E6 to the nucleolar protein results in alteration of one of these functions, E6 nucleolar localization and/or alteration of nucleolar organization. In a preferred embodiment, the nucleolar protein is a nucleophosmin, e.g. having a sequence given by GenBank accession number X16934.

In yet another embodiment, the cellular E6-binding protein is an apoferritin, preferably an apoferritin having the sequence provided by GenBank accession number X00318. The binding of E6 to the apoferritin protein could, for example, alter the regulation of oxidative events in the PV-infected cells, e.g. altering the ability of the cell to respond to oxidate damage. The binding of apoferritin could, either directly or through changes in the oxidative state of the cell, alter the expression of other cellular proteins important for neoplastic transformation, e.g. for the cell to enter crisis, such as the heat shock proteins. Moreover, transcriptional activation and repression of viral genes by E2 expression and DNA replication and has been suggested to be modulated by changes in the intracellular redox environment. Agents which inhibit the interaction could therefore be useful in preventing papillomavirus infection and/or transformation.

In a further embodiment, the cellular E6-binding protein is a nuclear ribonucleoprotein, preferably a C protein of the nuclear ribonucleoprotein particle C, such as provided by ATCC accession number M16342, or alternatively, a ribonucleoprotein E such as represented by the GenBank accession number X12466. The association of E6 with certain proteins involved in RNA processing reactions suggests a role for E6 which includes the direct regulation of expression of cellular and/or viral genes by post-transcriptional control of RNA splicing. Another role of such interactions with E6 could be to provide a nuclear localization signal for the protein by causing sequestration of E6 in the nucleus. Such an interaction is also a potential therapeutic target for inhibitors of papillomavirus infection and/or transformation.

In a still further embodiment, the E6-binding protein is a complement decay accelerating factor, such as represented in GenBank accession number M15799.

In yet another embodiment, the E6-binding protein is a proteasome subunit, such as the proteasome subunit HC8 represented by GenBank accession number D00762. The binding of E6 to proteosome subunits could provide a mechanism for recruiting protease complexes to other complexes involving E6 in order to facilitate/enhance degradation of proteins bound to E6, such as p53.

In addition to the discovery of the interaction of E6 with cellular proteins which have been previously cloned, a number of novel proteins are identified herein as possessing E6-binding capabilities. It is probable that the binding of E6 to these proteins causes, for example, alteration in the cellular function of these proteins and/or alteration in the localization of one or both of the E6 protein and E6-BP. Thus, while all aspects of the normal cellular role of certain of these proteins has not been fully elucidated, the fact that these proteins bind to the viral E6 protein, and that E6 is critical to the pathology of papillomavirus infection, shows that the interaction of each of these proteins with E6 provides potential therapeutic targets for developing agents which are useful in treating, for example, HPV infection.

For instance, in one embodiment, the E6-binding protein includes a protein sequence represented by SEQ ID No: 8 (clone SD-7), e.g. a polypeptide which binds a papillomavirus E6 protein; e.g. the E6-BP comprises one or more calcium binding motifs, e.g. EF hand motifs; e.g. the E6-BP comprises an ER/trans-golgi localization signal, e.g. a carboxy-terminal His-Asp-Glu-Leu (KDEL) sequence.

In another embodiment, the E6-binding protein is a human homolog of a molecular chaperone protein, such as the human chaperonin 10 protein represented by SEQ ID No: 12 (clone SD-22).

In yet further preferred embodiments: the E6-binding protein comprises a polypeptide represented by one of SEQ ID Nos: 9, 10, 11, 13 or 14 (clones SD-8, SD-12, SD-16, SD-28, and SD-32 respectively), e.g. a polypeptide which binds a papillomavirus E6 protein. We have also determined minimal E6-binding motifs, such as from SD-7. As described in the appended examples, the E6-binding motif is both necessary and sufficient to direct E6 binding. Moreover, the minimal E6-binding motif is likely to function antagonistically in vivo.

Another aspect of the invention pertains to an isolated nucleic acid comprising the nucleotide sequence encoding one of the subject E6-binding proteins, and/or equivalents of such nucleic acids. The term nucleic acid as used herein is intended to include fragments and equivalents. The term equivalent is understood to include nucleotide sequences encoding functionally equivalent E6-binding proteins or functionally equivalent peptides which, for example, retain the ability to bind to E6, and which may additionally retain other activities of an E6-BP such as described herein. Equivalent nucleotide sequences will include sequences that differ by one or more nucleotide substitutions, additions or deletions, such as allelic variants; and will, therefore, include sequences that differ from the nucleotide sequence E6-binding proteins shown in any of SEQ ID Nos: 1-7 due to the degeneracy of the genetic code. Equivalents will also include nucleotide sequences that hybridize under stringent conditions (i.e., equivalent to about 20-27° C. below the melting temperature (T_(m)) of the DNA duplex formed in about 1M salt) to the nucleotide sequence of the presently claimed E6-binding proteins represented in SEQ ID Nos: 1-7, or to the nucleotide sequence of an E6-binding protein from the pRS306-E6BP library (ATCC accession No: 75827). In one embodiment, equivalents will further include nucleic acid sequences derived from and evolutionarily related to, a nucleotide sequences shown in any of SEQ ID Nos: 1-7.

Moreover, it will be generally appreciated that, under certain circumstances, it may be advantageous to provide, homologs of the subject E6-binding proteins which function in a limited capacity as one of either an E6-BP agonists or an E6-BP antagonist, in order to promote or inhibit only a subset of the biological activities of the naturally-occurring form of the protein. Thus, specific biological effects can be elicited by treatment with a homolog of limited function, and with fewer side effects relative to treatment with agonists or antagonists which are directed to all of an E6-binding proteins biological activities.

Such homologs of the subject E6-binding proteins can be generated by mutagenesis, such as by discrete point mutation(s) or by truncation. For instance, mutation can give rise to homologs which retain substantially the same, or merely a subset, of the biological activity of the E6-BP from which it was derived. Alternatively, antagonistic forms of the protein can be generated which are able to inhibit the function of the naturally occurring form of the protein, such as by competitively binding to E6.

A protein has E6-BP biological activity if it has one or more of the following properties: the ability to modulate proliferation/cell growth of a eukaryotic cell, e.g. a mammalian cell, e.g. a human cell; the ability to modulate the efficacy of papillomavirus infection, e.g. human papillomaviruses, e.g. infection by HPV-16, HPV-18, HPV-31 or HPV-33; the ability to affect the efficacy of cell transformation, e.g. PV-mediated transformation, e.g. PV-mediated transformation, e.g. high risk HPV-mediated transformation; the ability to affect the efficacy of cellular immortalization, e.g. PV-mediated transformation, e.g. HPV-mediated transformation, e.g. high risk HPV-mediated immortalization; or the ability to bind a PV E6 protein, e.g. an HPV E6 protein, e.g. a high risk HPV E6 protein. A protein also has biological activity if it is a specific agonist or antagonist of one of the above recited properties.

As used herein, the term "nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term should also be understood to include, as equivalents, analogs of either RNA or DNA made from nucleotide analogs, and, as applicable to the embodiment being described, single-stranded (such as sense or antisense) and double-stranded polynucleotides.

As used herein, the term "gene" or "recombinant gene" refers to a nucleic acid comprising an open reading frame encoding an E6-binding protein of the present invention, including both exon and (optionally) intron sequences. A "recombinant gene" refers to nucleic acid encoding an E6-binding protein and comprising E6-BP encoding exon sequences, though it may optionally include intron sequences which are either derived from a chromosomal E6-BP gene or from an unrelated chromosomal gene. Exemplary recombinant genes encoding the subject E6-binding proteins are represented by any one of SEQ ID Nos: 1-7. Moreover, recombinant genes encoding each of the subject E6-binding proteins can be isolated from ATCC deposit No: 75827, as described below. The term "intron" refers to a DNA sequence present in a given E6-BP gene which is not translated into protein and is generally found between exons.

As used herein, the term "transfection" means the introduction of a nucleic acid, e.g., an expression vector, into a recipient cell by nucleic acid-mediated gene transfer. "Transformation", as used herein, refers to a process in which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or RNA, and, for example, the transformed cell expresses a recombinant form of the E6-binding protein of the present invention or where anti-sense expression occurs from the transferred gene, the expression of a naturally-occurring form of the E6-binding protein is disrupted.

As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of preferred vector is an episome, i.e., a nucleic acid capable of extra-chromosomal replication. Preferred vectors are those capable of autonomous replication and/expression of nucleic acids to which they are linked. Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of "plasmids" which refer to circular double stranded DNA loops which, in their vector form are not bound to the chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors which serve equivalent functions and which become known in the art subsequently hereto.

"Transcriptional regulatory sequence" is a generic term used throughout the specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, which induce or control transcription of protein coding sequences with which they are operably linked. In preferred embodiments, transcription of a recombinant E6-BP gene is under the control of a promoter sequence (or other transcriptional regulatory sequence) which controls the expression of the recombinant gene in a cell-type in which expression is intended. It will also be understood that the recombinant gene can be under the control of transcriptional regulatory sequences which are the same or which are different from those sequences which control transcription of the naturally-occurring form of the E6-binding protein.

As used herein, the term "tissue-specific promoter" means a DNA sequence that serves as a promoter, i.e., regulates expression of a selected DNA sequence operably linked to the promoter, and which effects expression of the selected DNA sequence in specific cells of a tissue, such as cells of a epithelial lineage, e.g. cervical squamous cells. In the illustrative embodiment of epithelial-specific promoters, gene constructs can be used as a part of gene therapy to deliver, for example, an E6-BP antagonist in order to modulate levels of E6/E6-BP complexes comprising one of the subject E6-binding proteins in papillomavirus-mediated disorders, e.g. papillomas, or to direct expression of an antisense construct of one of the subject E6-binding proteins in only epithelial tissue. The term also covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily in one tissue, but cause expression in other tissues as well.

As used herein, a "transgenic animal" is any animal, preferably a non-human mammal, e.g. a rat, a mouse or pig, in which one or more of the cells of the animal includes a transgene. The transgene is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. This molecule may be integrated within a chromosome, or it may be extrachromosomally replicating DNA. In the transgenic animals described herein, the transgene causes cells to express a recombinant form of one or more of the subject E6-binding proteins, or alternatively, to disrupt expression of one or more of the naturally-occurring forms of the E6-BP genes. However, transgenic animals in which a recombinant E6-BP gene is silent are also contemplated, as for example, the FLP or CRE recombinase dependent constructs described below. The "non-human animals" of the invention include vertebrates such as rodents, non-human primates, sheep, dog, cow, chickens, amphibians, reptiles, etc. Preferred non-human animals are selected from the rodent family including rat and mouse, most preferably mouse, though transgenic amphibians, such as members of the Xenopus genus, and transgenic chickens can also provide important tools for understanding, for example, embryogenesis and tissue patterning. The term "chimeric animal" is used herein to refer to animals in which the recombinant gene is found, or in which the recombinant is expressed in some but not all cells of the animal. The term "tissue-specific chimeric animal" indicates that the recombinant E6-BP gene is present and/or expressed in some tissues but not others.

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., one or more E6-binding proteins), which is partly or entirely heterologous, i.e., foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted at a location which differs from that of the natural gene or its insertion results in a knockout). A transgene can include one or more transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be necessary for optimal expression of a selected nucleic acid.

As is well known, genes for a particular polypeptide may exist in single or multiple copies within the genome of an individual. Such duplicate genes may be identical or may have certain modifications, including nucleotide substitutions, additions or deletions, which all still code for polypeptides having substantially the same activity. The term "DNA sequence encoding an E6-binding protein" may thus refer to one or more genes within a particular individual. Moreover, certain differences in nucleotide sequences may exist between individual organisms, which are called alleles. Such allelic differences may or may not result in differences in amino acid sequence of the encoded polypeptide yet still encode a protein with the same biological activity.

"Homology" refers to sequence similarity between two peptides or between two nucleic acid molecules. Homology can be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When a position in the compared sequence is occupied by the same base or amino acid, then the molecules are homologous at that position. A degree of homology between sequences is a function of the number of matching or homologous positions shared by the sequences.

"Cells," "host cells" or "recombinant host cells" are terms used interchangeably herein It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.

A "chimeric protein" or "fusion protein" is a fusion of a first amino acid sequence encoding one of the subject E6-binding proteins with a second amino acid sequence defining a domain foreign to and not substantially homologous with any domain of the subject E6-BP. A chimeric protein may present a foreign domain which is found (albeit in a different protein) in an organism which also expresses the first protein, or it may be an "interspecies", "intergeneric", etc. fusion of protein structures expressed by different kinds of organisms.

The term "evolutionarily related to", with respect to nucleic acid sequences encoding E6-binding proteins, refers to nucleic acid sequences which have arisen naturally in an organism, including naturally occurring mutants. The term also refers to nucleic acid sequences which, while derived from a naturally occurring E6-BP, have been altered by mutagenesis, as for example, combinatorial mutagenesis described below, yet still encode polypeptides which have at least one activity of an E6-binding protein.

The term "isolated" as also used herein with respect to nucleic acids, such as DNA or RNA, refers to molecules separated from other DNAs, or RNAs, respectively, that are present in the natural source of the macromolecule. For example, isolated nucleic acids encoding the subject E6-binding proteins preferably include no more than 10 kilobases (kb) of nucleic acid sequence which naturally immediately flanks a particular E6-BP gene in genomic DNA, more preferably no more than 5 kb of such naturally occurring flanking sequences, and most preferably less than 1.5 kb of such naturally occurring flanking sequence. The term isolated as used herein also refers to a nucleic acid or peptide that is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized. Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments which are not naturally occurring as fragments and would not be found in the natural state.

In one embodiment, the nucleic acid is a cDNA encoding a peptide having at least one activity of a subject E6-binding protein. Preferably, the nucleic acid is a cDNA molecule comprising at least a portion of the nucleotide sequence represented in one of SEQ ID Nos: 1-7. A preferred portion of these cDNA molecules includes the coding region of the gene.

Preferred nucleic acids encode an E6-binding protein comprising an amino acid sequence at least 60% homologous, more preferably 70% homologous and most preferably 80%, 90%, or 95% homologous with an amino acid sequence shown in one of SEQ ID Nos: 8-14. Nucleic acids which encode polypeptides having an activity of a subject E6-binding protein and having at least about 90%, more preferably at least about 95%, and most preferably at least about 98-99% homology with a sequence shown in one of SEQ ID Nos: 8-14 are also within the scope of the invention.

Certain of the nucleotide sequences shown in the appended sequence listing encode portions of the subject E6-binding proteins. Therefore, in a further embodiment of the invention, the recombinant E6-BP genes can include, in addition to nucleotides encoding the amino acid sequences shown in SEQ. ID. Nos: 1-7, additional nucleotide sequences which encode amino acids at the C-terminus and N-terminus of each protein. For instance, a recombinant E6-BP gene can include nucleotide sequences of a PCR fragment generated by amplifying one of the coding sequences for one of the E6-BP clones of ATCC deposit No: 75827 using sets of primers derived from Table 1 below.

Another aspect of the invention provides a nucleic acid which hybridizes under high or low stringency conditions to a nucleic acid which encodes a peptide having all or a portion of an amino acid sequence shown in SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 10, SEQ ID No: 11, SEQ ID No: 12, SEQ ID No: 13 or SEQ ID No: 14. Appropriate stringency conditions which promote DNA hybridization, for example, 6.0× sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0× SSC at 50° C., are known to those skilled in the art or can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0× SSC at 50° C. to a high stringency of about 0.2× SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22° C., to high stringency conditions at about 65° C.

Nucleic acids, having a sequence that differs from the nucleotide sequence shown any of SEQ ID Nos: 1-7 due to degeneracy in the genetic code are also within the scope of the invention. Such nucleic acids encode functionally equivalent peptides (i.e., a peptide having a biological activity of an E6-binding protein) but differ in sequence from the sequence shown in said sequence listings due to degeneracy in the genetic code. For example, a number of amino acids are designated by more than one triplet. Codons that specify the same amino acid, or synonyms (for example, CAU and CAC each encode histidine) may result in "silent" mutations which do not affect the amino acid sequence of the E6-binding protein. However, it is expected that DNA sequence polymorphisms that do lead to changes in the amino acid sequences of the subject E6-binding proteins will exist among vertebrates. One skilled in the art will appreciate that these variations in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids encoding polypeptides having an activity of an E6-binding protein may exist among individuals of a given species due to natural allelic variation. Any and all such nucleotide variations and resulting amino acid polymorphisms are within the scope of this invention.

Fragments of the nucleic acids encoding the active portion of the presently claimed E6-binding proteins are also within the scope of the invention. As used herein, a fragment of the nucleic acid encoding the active portion of an E6-binding protein refers to a nucleic acid having fewer nucleotides than the nucleotide sequence encoding the entire amino acid sequence of an E6-binding protein but which nevertheless encodes a peptide having an E6-BP biological activity, e.g.) an agonist activity of an E6-binding protein. Nucleic acid fragments within the scope of the present invention include those capable of hybridizing under high or low stringency conditions with nucleic acids from other species for use in screening protocols to detect E6-BP homologs, as well as those capable of hybridizing with nucleic acids from human specimens for use in detecting the presence of a nucleic acid encoding one of the subject E6-BPs, including alternate isoforms, e.g. mRNA splicing variants. Nucleic acids within the scope of the invention may also contain linker sequences, modified restriction endonuclease sites and other sequences useful for molecular cloning, expression or purification of recombinant forms of the subject E6-binding proteins.

As used herein, an "E6-binding motif" refers to a polypeptide sequence which confers a binding activity for specifically interacting with a papillomavirus E6 protein. An exemplary E6-binding motif is represented by Ala194-Asp218 of SEQ ID No. 8.

As indicated by the examples set out below, a nucleic acid encoding a peptide having an activity of an E6-binding protein may be obtained from mRNA present in any of a number of eukaryotic cells. It should also be possible to obtain nucleic acids encoding E6-binding proteins of the present invention from genomic DNA obtained from both adults and embryos. For example, a gene encoding an E6-binding protein can be cloned from either a cDNA or a genomic library in accordance with protocols herein described, as well as those generally known to persons skilled in the art. A cDNA encoding one of the subject E6-binding proteins can be obtained by isolating total mRNA from a cell, e.g. a mammalian cell, e.g. a human cell, including tumor cells. Double stranded cDNAs can then be prepared from the total mRNA, and subsequently inserted into a suitable plasmid or bacteriophage vector using any one of a number of known techniques. The gene encoding the E6-binding protein can also be cloned using established polymerase chain reaction techniques in accordance with the nucleotide sequence information provided by the invention. The nucleic acid of the invention can be DNA or RNA. A preferred nucleic acid is a cDNA represented by the sequence shown in SEQ ID No: 1; e.g. provided that the nucleic acid hybridizes to a nucleic acid probe comprising a subsequence preferably encoding at least four consecutive amino acid residues between residues 1 and 133 of SEQ ID No: 8, more preferably at least ten consecutive amino acid residues, more preferably at least twenty consecutive amino acid residues. In yet another preferred embodiment, the nucleic acid will include a nucleotide sequence that hybridizes to a nucleotide sequence encoding an E6-binding motif. Another nucleic acid is a cDNA represented by the sequence shown in SEQ ID No: 2. Other preferred nucleic acids include cDNA molecules represented by the sequences shown in one of SEQ ID Nos: 3-7. A preferred nucleic acid is a cDNA derived from the pRS306-E6BP library (ATCC deposit No: 75827).

Another aspect of the invention relates to the use of the isolated nucleic acid in "antisense" therapy. As used herein, "antisense" therapy refers to administration or in situ generation of oligonucleotide probes or their derivatives which specifically hybridizes (e.g. binds) under cellular conditions, with the cellular mRNA and/or genomic DNA encoding an E6-binding protein so as to inhibit expression of that protein, e.g. by inhibiting transcription and/or translation. The binding may be by conventional base pair complementarity, or, for example, in the case of binding to DNA duplexes, through specific interactions in the major groove of the double helix. In general, "antisense" therapy refers to the range of techniques generally employed in the art, and includes any therapy which relies on specific binding to oligonucleotide sequences.

An antisense construct of the present invention can be delivered, for example, as an expression plasmid which, when transcribed in the cell, produces RNA which is complementary to at least a unique portion of the cellular mRNA which encodes an E6-binding protein. Alternatively, the antisense construct is an oligonucleotide probe which is generated ex vivo and which, when introduced into the cell causes inhibition of expression by hybridizing with the mRNA and/or genomic sequences encoding an E6-binding protein. Such oligonucleotide probes are preferably modified oligonucleotide which are resistant to endogenous nucleases, e.g. exonucleases and/or endonucleases, and is therefore stable in vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. Pat. Nos. 5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to constructing oligomers useful in antisense therapy have been reviewed, for example, by van der Krol et al. (1988) Biotechniques 6:958-976; and Stein et al. (1988) Cancer Res 48:2659-2668.

Accordingly, the modified oligomers of the invention are useful in therapeutic, diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized in a manner appropriate for antisense therapy in general. For such therapy, the oligomers of the invention can be formulated for a variety of modes of administration, including systemic and topical or localized administration. Techniques and formulations generally may be found in Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, Pa. For systemic administration, injection is preferred, including intramuscular, intravenous, intraperitoneal, and subcutaneous for injection, the oligomers of the invention can be formulated in liquid solutions, preferably in physiologically compatible buffers such as Hank's solution or Ringer's solution. In addition, the oligomers may be formulated in solid form and redissolved or suspended immediately prior to use. Lyophilized forms are also included.

Systemic administration can also be by transmucosal or transdermal means, or the compounds can be administered orally. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, for example, for transmucosal administration bile salts and fusidic acid derivatives. In addition, detergents may be used to facilitate permeation. Transmucosal administration may be through nasal sprays or using suppositories. For oral administration, the oligomers are formulated into conventional oral administration forms such as capsules, tablets, and tonics. For topical administration, the oligomers of the invention are formulated into ointments, salves, gels, or creams as generally known in the art.

In addition to use in therapy, the oligomers of the invention may be used as diagnostic reagents to detect the presence or absence of the target DNA or RNA sequences to which they specifically bind.

The present invention also provides nucleic acid encoding only a portion of an E6-binding protein, such as the E6-binding motif. As used herein, a fragment of a nucleic acid encoding such a portion of an E6-binding protein refers to a nucleotide sequence having fewer nucleotides than the nucleotide sequence encoding the entire amino acid sequence of a full-length E6-binding protein, yet which still includes enough of the coding sequence so as to encode a polypeptide which is capable of binding to an E6 protein. Moreover, nucleic acid fragments within the scope of the invention include those fragments capable of hybridizing under high or low stringency conditions with nucleic acids from other vertebrate species, particularly other mammals, and can be used in screening protocols to detect homologs, of the subject E6-binding proteins. Nucleic acids within the scope of the invention may also contain linker sequences, modified restriction endonuclease sites and other sequences useful for molecular cloning, expression or purification of recombinant peptides derived from E6-binding proteins.

This invention also provides expression vectors containing a nucleic acid encoding a peptide having an activity of an E6-binding protein, operably linked to at least one transcriptional regulatory sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a regulatory sequence in a manner which allows expression of the nucleotide sequence. Regulatory sequences are art-recognized and are selected to direct expression of the peptide having an activity of an E6-binding protein. Accordingly, the term transcriptional regulatory sequence includes promoters, enhancers and other expression control elements. Exemplary regulatory sequences are described in Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). For instance, any of a wide variety of expression control sequences-sequences that control the expression of a DNA sequence when operatively linked to it may be used in these vectors to express DNA sequences encoding the E6-binding proteins of this invention. Such useful expression control sequences, include, for example, the early and late promoters of SV40, adenovirus or cytomegalovirus immediate early promoter, the lac system, the trp system, the TAC or TRC system, T7 promoter whose expression is directed by T7 RNA polymerase, the major operator and promoter regions of phage lambda, the control regions for fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the promoters of the yeast α-mating factors, the polyhedron promoter of the baculovirus system and other sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof. It should be understood that the design of the expression vector may depend on such factors as the choice of the host cell to be transformed and/or the type of protein desired to be expressed. In one embodiment, the expression vector includes a recombinant gene encoding a peptide having an activity of a subject E6-binding protein, or alternatively, encoding a peptide which is an antagonistic form of the subject E6-binding protein. Such expression vectors can be used to transfect cells and thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein.

Another aspect of the present invention concerns recombinant forms of the subject E6-binding proteins which are encoded by genes derived from eukaryotic organisms, e.g. mammals, e.g. humans, and which have at least one biological activity of an E6-binding protein, e.g., which is an antagonist of at least one activity of an E6-BP of the present invention, including naturally occurring dysfunctional mutants. The term "recombinant protein" refers to a protein of the present invention which is produced by recombinant DNA techniques, wherein generally DNA encoding the subject E6-binding protein is inserted into a suitable expression vector which is in turn used to transform a host cell to produce the heterologous protein. Moreover, the phrase "derived from", with respect to a recombinant gene encoding the recombinant E6-BP, is meant to include within the meaning of "recombinant protein" those proteins having an amino acid sequence of a native E6-BP of the present invention, or an amino acid sequence similar thereto which is generated by mutations including substitutions and deletions (including truncation) of a naturally occurring E6-binding protein of a organism. Recombinant proteins preferred by the present invention, in addition to native E6-binding proteins, are at least 60% homologous, more preferably 70% homologous and most preferably 80% homologous with an amino acid sequence shown in one of SEQ ID Nos: 8-14. Polypeptides having an activity of the subject E6-binding proteins (i.e. either agonistic or antagonistic) and having at least about 90%, more preferably at least about 95%, and most preferably at least about 98-99% homology with a sequence of either in SEQ ID No: 8-14 are also within the scope of the invention.

The present invention further pertains to recombinant forms of the subject E6-binding proteins which are encoded by genes derived from a organism and which have amino acid sequences evolutionarily related to an E6-binding protein of either SEQ ID No: 8-14. Such recombinant E6-binding proteins preferably are capable of functioning in one of either role of an agonist or antagonist of at least one biological activity of the present E6-BPs. The term "evolutionarily related to", with respect to amino acid sequences of the present recombinant E6-binding proteins, refers to E6-binding proteins having amino acid sequences which have arisen naturally, as well as mutational variants of E6-binding proteins which are derived, for example, by combinatorial mutagenesis. Such evolutionarily derived E6-binding protein preferred by the present invention are at least 60% homologous, more preferably 70% homologous and most preferably 80% homologous with an amino acid sequence shown in either SEQ ID No: 8, SEQ ID No: 9, SEQ ID No: 10, SEQ ID No: 11, SEQ ID No: 12, SEQ ID No: 13 or SEQ ID No: 14. Polypeptides having at least about 90%, more preferably at least about 95%, and most preferably at least about 98-99% homology with a sequence shown in any of SEQ ID Nos: 8-14 are also within the scope of the invention.

A salient feature of the present invention is the identification of the E6 binding motif of the E6-BP^(SD7) protein. For instance, the present invention provides portions of the SD-7 protein which may be easier to manipulate than the full length protein. As described in the appended examples, the present invention provides polypeptides which include a portions of the SD-7 protein which retain the ability to bind to the E6 protein. Such E6-binding motifs can include an amino acid sequence corresponding to Ala194-Asp218 of SEQ ID No. 8.

Accordingly, the present invention provides polypeptides comprising an E6-binding motif of an SD-7 protein, which SD-7 portion of the polypeptide may be represented by the general formula X-Y-Z, wherein Y represents an amino acid sequence of an E6-binding motif within residues 194 to 218 of SEQ ID No. 8, X is absent, or represents an amino acid sequence, e.g., a sequence between 1-194, 1-100, 1-75, 1-50 or 1-25 residues in length, e.g., all or a portion of the amino acid sequence between residues 1 and 194 of SEQ ID No. 8 and (optionally) immediately N-terminal to Y, and Z is absent, or represents an amino acid sequence, e.g., a sequence between 1-99, 1-75, 1-50 or 1-25 residues in length, e.g., all or a portion of the amino acid sequence between residues 218 and 317 of SEQ ID No. 8 and (optionally) immediately C-terminal to Y. Preferably, the polypeptide includes only about 25 to 200 residues of SD-7 polypeptide sequence, though more preferably includes only about 25, 50, 75 or 100 amino acid residues. In illustrative embodiments, the polypeptide used to generate the subject assay includes: an SD-7 polypeptide sequence corresponding to Ala194 through about Asp218; an SD-7 polypeptide sequence corresponding to Met99 through about Leu317; an SD-7 polypeptide sequence corresponding to Val107 through about Asp218; an SD-7 polypeptide sequence corresponding to Ala194 through about Glu316.

The present invention further pertains to methods of producing the subject E6-binding proteins. For example, a host cell transfected with a nucleic acid vector directing expression of a nucleotide sequence encoding the subject E6-binding protein can be cultured under appropriate conditions to allow expression of the peptide to occur. The peptide may be secreted and isolated from a mixture of cells and medium containing the recombinant E6-BP. Alternatively, the peptide may be retained cytoplasmically and the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media and other byproducts. Suitable media for cell culture are well known in the art. The recombinant E6-BP peptide can be isolated from cell culture medium, host cells, or both using techniques known in the art for purifying proteins including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for such peptide. In a preferred embodiment, the recombinant E6-binding protein is a fusion protein containing a domain which facilitates its purification, such as an E6-BP-GST fusion protein.

This invention also pertains to a host cell transfected to express a recombinant form of at least one of the subject E6-binding proteins. The host cell may be any prokaryotic or eukaryotic cell. Thus, a nucleotide sequence derived from the cloning of the E6-binding proteins of the present invention, encoding all or a selected portion of a protein, can be used to produce a recombinant form of an E6-BP via microbial or eukaryotic cellular processes. Ligating the polynucleotide sequence into a gene construct, such as an expression vector, and transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect or mammalian) or prokaryotic (bacterial cells), are standard procedures used in producing other well-known proteins, e.g. p53, E6, E6-AP, and the like. Similar procedures, or modifications thereof, can be employed to prepare recombinant E6-binding proteins, or portions thereof, by microbial means or tissue-culture technology in accord with the subject invention.

The recombinant E6-binding gene can be produced by ligating nucleic acid encoding a subject E6-binding protein, or a portion thereof, into a vector suitable for expression in either prokaryotic cells, eukaryotic cells, or both. Expression vectors for production of recombinant forms of the subject E6-binding proteins include plasmids and other vectors. For instance, suitable vectors for the expression of an E6-BP include plasmids of the types: pBR322-derived plasmids, pEMBL-derived plasmids, pEX-derived plasmids, pBTac-derived plasmids and pUC-derived plasmids for expression in prokaryotic cells, such as E. coli.

A number of vectors exist for the expression of recombinant proteins in yeast. For instance, YEP24, YIP5, YEP51, YEP52, pYES2, and YRP17 are cloning and expression vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, for example, Broach et al. (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye Academic Press, p. 83, incorporated by reference herein). These vectors can replicate in E. coli due the presence of the pBR322 ori, and in S. cerevisiae due to the replication determinant of the yeast 2 micron plasmid. In addition, drug resistance markers such as ampicillin can be used. In an illustrative embodiment, an E6-binding protein is produced recombinantly utilizing an expression vector generated by sub-cloning a gene encoding the protein from pRS306-E6BP library (ATCC accession No: 75827) using, for example, primers based on SEQ ID No: 1-7 and/or primers based on the flanking plasmid sequence (e.g. the primers represented by SEQ ID Nos: 15-17).

The preferred mammalian expression vectors contain both prokaryotic sequences to facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic cells. Some of these vectors are modified with sequences from bacterial plasmids, such as pBR322, to facilitate replication and drug resistance selection in both prokaryotic and eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papilloma virus (BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient expression of proteins in eukaryotic cells. The various methods employed in the preparation of the plasmids and transformation of host organisms are well known in the art. For other suitable expression systems for both prokaryotic and eukaryotic cells, as well as general recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 and 17. In some instances, it may be desirable to express the recombinant E6-BP by the use of a baculovirus expression system. Examples of such baculovirus expression systems include pVL-derived vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors (such as pAcUW1), and pBlueBac-derived vectors (such as the β-gal containing pBlueBac III).

When expression of a portion of one of the subject E6-binding protein is desired, i.e. a trunction mutant, it may be necessary to add a start codon (ATG) to the oligonucleotide fragment containing the desired sequence to be expressed. It is well known in the art that a methionine at the N-terminal position can be enzymatically cleaved by the use of the enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben-Bassat et al. (1987) J. Bacteriol. 169:751-757) and Salmonella typhimurium and its in vitro activity has been demonstrated on recombinant proteins (Miller et al. (1987) PNAS 84:2718-1722). Therefore, removal of an N-terminal methionine, if desired, can be achieved either in vivo by expressing E6-BP-derived polypeptides in a host which produces MAP (e.g., E. coli or CM89 or S. cerevisiae), or in vitro by use of purified MAP (e.g., procedure of Miller et al., supra).

Alternatively, the coding sequences for the polypeptide can be incorporated as a part of a fusion gene including a nucleotide sequence encoding a different polypeptide. This type of expression system can be useful under conditions where it is desirable to produce an immunogenic fragment of an E6-binding protein. For example, the VP6 capsid protein of rotavirus can be used as an immunologic carrier protein for portions of the E6-BP polypeptide, either in the monomeric form or in the form of a viral particle. The nucleic acid sequences corresponding to the portion of a subject E6-binding protein to which antibodies are to be raised can be incorporated into a fusion gene construct which includes coding sequences for a late vaccinia virus structural protein to produce a set of recombinant viruses expressing fusion proteins comprising a portion of the protein E6-BP as part of the virion. It has been demonstrated with the use of immunogenic fusion proteins utilizing the Hepatitis B surface antigen fusion proteins that recombinant Hepatitis B virions can be utilized in this role as well. Similarly, chimeric constructs coding for fusion proteins containing a portion of an E6-binding protein and the poliovirus capsid protein can be created to enhance immunogenicity of the set of polypeptide antigens (see, for example, EP Publication No: 0259149; and Evans et al. (1989) Nature 339:385; Huang et al. (1988) J. Virol. 62:3855; and Schlienger et al. (1992) J. Virol. 66:2).

The Multiple Antigen Peptide system for peptide-based immunization can also be utilized to generate an immunogen, wherein a desired portion of a subject E6-binding protein is obtained directly from organo-chemical synthesis of the peptide onto an oligomeric branching lysine core (see, for example, Posnett et al. (1988) JBC 263:1719 and Nardelli et al. (1992) J. Immunol. 148:914). Antigenic determinants of the subject E6-binding proteins can also be expressed and presented by bacterial cells.

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely appreciated that fusion proteins can also facilitate the expression of proteins, such as any one of the E6-binding protein of the present invention. For example, as described below, an E6-binding protein of the present invention can be generated as a glutathione-S-transferase (GST-fusion protein). Such GST fusion proteins can enable easy purification of the E6-binding protein, such as by the use of glutathione-derivatized matrices (see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 1991)).

In another embodiment, a fusion gene coding for a purification leader sequence, such as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired portion of the E6-binding protein, can allow purification of the poly(His)- expressed E6-BP-fusion protein by affinity chromatography using a Ni²⁺ metal resin. The purification leader sequence can then be subsequently removed by treatment with enterokinase (e.g., see Hochuli et al. (1987) J. Chromatography 411:177; and Janknecht et al. PNAS 88:8972).

Techniques for making fusion genes are known to those skilled in the art. Essentially, the joining of various DNA fragments coding for different polypeptide sequences is performed in accordance with conventional techniques, employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers which give rise to complementary overhangs between two consecutive gene fragments which can subsequently be annealed to generate a chimeric gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al. John Wiley & Sons: 1992).

Another aspect of the invention pertains to isolated peptides having an activity of one of the subject E6-binding protein, or which are antagonists of at least one of the biological activities of the subject E6-BPs. In preferred embodiments, a biological activity of an E6-binding protein includes: an ability to modulate proliferation/cell growth of a eukaryotic cell, e.g. a mammalian cell, e.g. a human cell; an ability to affect the efficacy of papillomavirus infection, e.g. infection by HPV-16, HPV-18, HPV-31 or HPV-33; an ability to affect the efficacy of cell transformation, e.g. PV-mediated transformation, e.g. PV-mediated transformation, e.g. high risk HPV-mediated transformation; an ability to affect the efficacy of cellular immortalization, e.g. PV-mediated transformation, e.g. HPV-mediated transformation, e.g. high risk HPV-mediated immortalization; an ability to bind a PV E6 protein, e.g. an HPV E6 protein, e.g. a high risk HPV E6 protein. E6-binding proteins of the present invention, particularly those with antagonistic activity, can have the ability to suppress tumor growth, e.g. in a tumor cell in which endogenous E6-BP is mis-expressed. Other biological activities of the subject E6-binding proteins are described herein or will be reasonably apparent to those skilled in the art. A polypeptide having at least one biological activity of the subject E6-binding proteins may differ in amino acid sequence from the sequence shown in either SEQ ID No: 8-14, but such differences result in a modified protein which functions in the same or similar manner as the native E6-binding protein or which has the same or similar characteristics of the native E6-binding protein. Various modifications of a E6-binding protein of the present invention to produce these and other functionally equivalent peptides are described in detail herein. The term peptide, as used herein, refers to peptides, proteins, and polypeptides.

The present invention also makes available isolated E6-binding proteins which are isolated from, or otherwise substantially free of other cellular or viral proteins, especially papillomavirus proteins, normally associated with the E6-binding protein. The term "substantially free of other cellular or viral proteins" (also referred to herein as "contaminating proteins") or "substantially pure or purified preparations" are defined as encompassing E6-BP preparations having less than 20% (by dry weight) contaminating protein, and preferably having less than 5% contaminating protein. Functional forms of the subject E6-binding proteins can be prepared, for the first time, as purified preparations by using a cloned gene as described herein. By "purified", it is meant, when referring to a peptide or DNA or RNA sequence, that the indicated molecule is present in the substantial absence of other biological macromolecules, such as other proteins (particularly viral proteins such as E6, as well as other contaminating proteins). The term "purified" as used herein preferably means at least 80% by dry weight, more preferably in the range of 95-99% by weight, and most preferably at least 99.8% by weight, of biological macromolecules of the same type present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 5000, can be present). The term "pure" as used herein preferably has the same numerical limits as "purified" immediately above. "Isolated" and "purified" do not encompass either natural materials in their native state or natural materials that have been separated into components (e.g., in an acrylamide gel) but not obtained either as pure (e.g. lacking contaminating proteins, or chromatography reagents such as denaturing agents and polymers, e.g. acrylamide or agarose) substances or solutions.

However, the subject polypeptides can also be provided in pharmaceutically acceptable carriers and formulated for a variety of modes of administration, including systemic and topical or localized administration. Techniques and formulations generally may be found in Remmington's Pharmaceutical Sciences, Meade Publishing Co., Easton, Pa. In an exemplary embodiment, the E6-binding protein (which includes bioactive fragments such as antagonists) is provided for transmucosal or transdermal delivery. For such administration, penetrants appropriate to the barrier to be permeated are used in the formulation with the polypeptide. Such penetrants are generally known in the art, and include, for example, for transmucosal administration bile salts and fusidic acid derivatives. In addition, detergents may be used to facilitate permeation. Transmucosal administration may be through nasal sprays or using suppositories. For topical administration, the oligomers of the invention are formulated into ointments, salves, gels, or creams as generally known in the art.

In one embodiment, a purified preparation of an antagonistic form of an SD-7 derived polypeptide, such as a fragment as described herein including the E6-binding motif, can be provided in a pharmaceutical preparation suitable for topical administration to epithelial tissue infected and/or transformed by a papillomavirus.

Another aspect of the invention relates to polypeptides derived from full-length E6-binding proteins. Isolated peptidyl portions of the subject E6-binding proteins, such as the E6-binding motif, can be obtained by screening peptides recombinantly produced from the corresponding fragment of the nucleic acid encoding such peptides. In addition, fragments can be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, an E6-binding protein of the present invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, or preferably divided into overlapping fragments of a desired length. The fragments can be produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments which can function as either agonists or antagonists of an E6-binding protein activity, such as by microinjection assays. In an illustrative embodiment, peptidyl portions of E6-binding proteins can be tested for E6-binding activity, as well as inhibitory ability, by expression as, for example, thioredoxin fusion proteins, each of which contains a discrete fragment of the E6-binding protein (see, for example, U.S. Pat. Nos. 5,270,181 and 5,292,646; and PCT publication WO94/02502).

It is also possible to modify the structure of the subject E6-binding proteins for such purposes as enhancing therapeutic or prophylactic efficacy, or stability (e.g., ex vivo shelf life and resistance to proteolytic degradation in vivo). Such modified peptides, when designed to retain at least one activity of the naturally-occurring form of the protein, are considered functional equivalents of the E6-binding protein described in more detail herein. Such modified peptide can be produced, for instance, by amino acid substitution, deletion, or addition.

Moreover, it is reasonable to expect that an isolated replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a similar replacement of an amino acid with a structurally related amino acid (i.e. conservative mutations) will not have a major effect on the biological activity of the resulting molecule. Conservative replacements are those that take place within a family of amino acids that are related in their side chains. Genetically encoded amino acids are can be divided into four families: (1) acidic=aspartate, glutamate; (2) basic=lysine, arginine, histidine; (3) nonpolar=alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and (4) uncharged polar=glycine, asparagine, glutamine, cysteine, serine, threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be grouped as (1) acidic=aspartate, glutamate; (2) basic=lysine, arginine histidine, (3) aliphatic=glycine, alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic=phenylalanine, tyrosine, tryptophan; (5) amide=asparagine, glutamine; and (6) sulfur-containing=cysteine and methionine. (see, for example, Biochemistry, 2nd ed., Ed. by L. Stryer, W. H. Freeman and Co.: 1981). Whether a change in the amino acid sequence of a peptide results in a functional E6-BP homolog can be readily determined by assessing the ability of the variant peptide to produce a response in cells in a fashion similar to the wild-type E6-BP. Peptides in which more than one replacement has taken place can readily be tested in the same manner.

This invention further contemplates a method of generating sets of combinatorial mutants of the presently disclosed novel E6-binding proteins, as well as truncation mutants, and is especially useful for identifying potential variant sequences which are functional in binding to a PV E6 protein, especially an E6 protein of a high risk HPV. One purpose for screening such combinatorial libraries is, for example, to isolate novel E6-BP homologs which function as one of either an agonist or antagonist of the biological activities of the wild-type ("authentic") protein, or alternatively, possess novel activities all together. To illustrate, E6-BP homologs can be engineered by the present method to provide proteins which bind E6 yet which act antagonistically to the role of the native E6-BP in papillomavirus infection, transformation and/or immortalization. Such proteins, when expressed from recombinant DNA constructs, can be used in gene therapy protocols.

Likewise, mutagenesis can give rise to E6-BP homologs which have intracellular half-lives dramatically different than the corresponding wild-type protein. For example, the altered protein can be rendered either more stable or less stable to proteolytic degradation or other cellular process which result in destruction of, or otherwise inactivation of, an E6-binding protein. Such E6-BP homologs and the genes which encode them, can be utilized to alter the envelope of expression for the particular recombinant E6 binding proteins by modulating the half-life of the recombinant protein. For instance, a short half-life can give rise to more transient biological effects associated with a particular recombinant E6-BP and, when part of an inducible expression system, can allow tighter control of recombinant E6-BP levels within the cell. As above, such proteins, and particularly their recombinant nucleic acid constructs, can be used in gene therapy protocols.

In an illustrative embodiment of this method, the amino acid sequences for a population of E6-BP homologs or other related proteins are aligned, preferably to promote the highest homology possible. Such a population of variants can include, for example, E6-BP homologs from one or more species, or E6-BP homologs from the same species but which differ due to mutation. Amino acids which appear at each position of the aligned sequences are selected to create a degenerate set of combinatorial sequences.

In a preferred embodiment, the combinatorial E6-BP library is produced by way of a degenerate library of genes encoding a library of polypeptides which each include at least a portion of potential E6-BP sequences. A mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential E6-BP sequences are expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g. for phage display) containing the set of E6-BP sequences therein.

There are many ways by which the library of potential E6-BP homologs can be generated from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes then be ligated into an appropriate gene for expression. The purpose of a degenerate set of genes is to provide, in one mixture, all of the sequences encoding the desired set of potential E6-BP sequences. The synthesis of degenerate oligonucleotides is well known in the art (see for example, Narang, S. A. (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. A. G. Walton, Amsterdam: Elsevier pp. 273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87:6378-6382; as well as U.S. Pat. Nos.: 5,223,409, 5,198,346, and 5,096,815).

Alternatively, other forms of mutagenesis can be utilized to generate a combinatorial library. For example, E6-BP homologs (both agonist and antagonist forms) can be generated and isolated from a library by screening using, for example, alanine scanning mutagenesis and the like (Ruf et al. (1994) Biochemistry 33:1565-1572; Wang et al. (1994) J. Biol. Chem. 269:3095-3099; Balint et al. (1993) Gene 137:109-118; Grodberg et al. (1993) Eur. J. Biochem. 218:597-601; Nagashima et al. (1993) J. Biol. Chem. 268:2888-2892; Lowman et al. (1991) Biochemistry 30:10832-10838; and Cunningham et al. (1989) Science 244:1081-1085), by linker scanning mutagenesis (Gustin et al. (1993) Virology 193:653-660; Brown et al. (1992) Mol. Cell Biol. 12:2644-2652; McKnight et al. (1982) Science 232:316); by saturation mutagenesis (Meyers et al. (1986) Science 232:613); by PCR mutagenesis (Leung et al. (1989) Method Cell Mol Biol 1:11-19); or by random mutagenesis (Miller et al. (1992) A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, N.Y.; and Greener et al. (1994) Strategies in Mol Biol 7:32-34). Linker scanning matagenesis, particularly in a combinatorial setting, is on attractive method for identifying truncated (bioactive) forms of the E6-binding proteins.

A wide range of techniques are known in the art for screening gene products of combinatorial libraries made by point mutations, and for screening cDNA libraries for gene products having a certain property. Such techniques will be generally adaptable for rapid screening of the gene libraries generated by the combinatorial mutagenesis of E6-BP homologs. The most widely used techniques for screening large gene libraries typically comprises cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired activity facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Each of the illustrative assays described below are amenable to high through-put analysis as necessary to screen large numbers of degenerate E6-BP sequences created by combinatorial mutagenesis techniques.

In one screening assay, the candidate E6-BP gene products are displayed on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an E6 protein, such HPV-16 E6, via this gene product is detected in a "panning assay". For instance, the gene library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). In a similar fashion, fluorescently labeled E6 can be used to score for potentially functional E6-BP homologs. Cells can be visually inspected and separated under a fluorescence microscope, or, where the morphology of the cell permits, separated by a fluorescence-activated cell sorter.

In an alternate embodiment, the gene library is expressed as a fusion protein on the surface of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be expressed on the surface of infectious phage, thereby conferring two significant benefits. First, since these phage can be applied to affinity matrices at very high concentrations, a large number of phage can be screened at one time. Second, since each infectious phage displays the combinatorial gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of infection. The group of almost identical E. coli filamentous phages M13, fd, and f1 are most often used in phage display libraries, as either of the phage gIII or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging of the viral particle (Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461).

In an illustrative embodiment, the recombinant phage antibody system (RPAS, Pharmacia Catalog number 27-9400-01) can be easily modified for use in expressing and screening E6-BP combinatorial libraries. For instance, the pCANTAB 5 phagemid of the RPAS kit contains the gene which encodes the phage gIII coat protein. The E6-BP combinatorial gene library can be cloned into the phagemid adjacent to the gIII signal sequence such that it will be expressed as a gIII fusion protein. After ligation, the phagemid is used to transform competent E. coli TG1 cells. Transformed cells are subsequently infected with M13KO7 helper phage to rescue the phagemid and its candidate E6-BP gene insert. The resulting recombinant phage contain phagemid DNA encoding a specific candidate E6-BP, and display one or more copies of the corresponding fusion coat protein. Those phage-displayed candidate E6-BPs which are capable of binding a E6 are selected or enriched by panning with E6. For instance, the phage library can be panned on glutathione immobilized E6-GST fusion proteins, and unbound phage washed away from the cells. The bound phage is then isolated, and if the recombinant phage express at least one copy of the wild type gIII coat protein, they will retain their ability to infect E. coli. Thus, successive rounds of reinfection of E. coli, and panning will greatly enrich for E6-BP homologs, which can retain an ability to bind E6 which can subsequently be screened for further biological activities in order to differentiate agonists and antagonists.

The invention also provides for reduction of the E6-binding motifs of the subject E6-binding proteins to generate mimetics, e.g. peptide or non-peptide agents, which are able to disrupt binding of an E6-BP of the present invention with a papillomavirus E6 protein. Thus, such mutagenic techniques are particularly useful to map the determinants of the E6-BP which participate in protein-protein interactions involved in, for example, binding of the subject E6-binding protein to a PV E6 protein. To illustrate, the critical residues of a subject E6-binding protein which are involved in molecular recognition of E6 can be determined and used to generate E6-BP-derived peptidomimetics which competitively inhibit binding of the E6-BP with E6 (see, for example, "Peptide inhibitors of human papillomavirus protein binding to retinoblastoma gene protein" European patent applications EP-412,762A and EP-B31,080A). By employing, for example, scanning mutagenesis to map the amino acid residues of a particular E6-binding protein involved in binding E6, peptodomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated which mimic those residues in binding to E6, and which therefore can inhibit binding of the E6-BP to E6 and thereby interfere with the function of E6 in PV infection. For instance, non-hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffinan et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, Ill., 1985), β-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and β-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and Dann et al. (1986) Biochem Biophys Res Commun 134:71).

Another aspect of the invention pertains to an antibody specifically reactive with one of the subject E6-binding proteins. For example, by using immunogens derived from the present activity E6-binding proteins, based on the cDNA sequences, anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a hamster or rabbit can be immunized with an immunogenic form of the peptide (e.g., E6-binding protein or an antigenic fragment which is capable of eliciting an antibody response). Techniques for conferring immunogenicity on a protein or peptide include conjugation to carriers or other techniques well known in the art. An immunogenic portion of the subject E6-binding proteins can be administered in the presence of adjuvant. The progress of immunization can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other immunoassays can be used with the immunogen as antigen to assess the levels of antibodies. In a preferred embodiment, the subject antibodies are immunospecific for antigenic determinants of the E6-binding proteins of the present invention, e.g. antigenic determinants of a protein represented by one of SEQ ID Nos: 8-14 or a closely related human or non-human mammalian homolog (e.g. 90 percent homologous, more preferably at least 95 percent homologous). In yet a further preferred embodiment of the present invention, the anti-E6-BP antibodies do not substantially cross react (i.e. react specifically) with a protein which is: e.g. less than 90 percent homologous to one of SEQ ID Nos: 8-14; e.g. less than 95 percent homologous with one of SEQ ID Nos: 8-14; e.g. less than 98-99 percent homologous with one of SEQ ID Nos: 8-14. By "not substantially cross react", it is meant that the antibody has a binding affinity for a non-homologous protein (e.g. E6) which is less than 10 percent, more preferably less than 5 percent, and even more preferably less than 1 percent, of the binding affinity for a protein of SEQ ID Nos: 8-14.

Following immunization, anti-E6-BP antisera can be obtained and, if desired, polyclonal anti-E6-BP antibodies isolated from the serum. To produce monoclonal antibodies, antibody producing cells (lymphocytes) can be harvested from an immunized animal and fused by standard somatic cell fusion procedures with immortalizing cells such as myeloma cells to yield hybridoma cells. Such techniques are well known in the art, an include, for example, the hybridoma technique (originally developed by Kohler and Milstein, (1975) Nature, 256:495-497), the human B cell hybridoma technique (Kozbar et al., (1983) Immunology Today, 4:72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc. pp. 77-96). Hybridoma cells can be screened immunochemically for production of antibodies specifically reactive with an E6-binding protein of the present invention and monoclonal antibodies isolated from a culture comprising such hybridoma cells.

The term antibody as used herein is intended to include fragments thereof which are also specifically reactive with one of the subject E6-binding protein. Antibodies can be fragmented using conventional techniques and the fragments screened for utility in the same manner as described above for whole antibodies. For example, F(ab')₂ fragments can be generated by treating antibody with pepsin. The resulting F(ab')₂ fragment can be treated to reduce disulfide bridges to produce Fab' fragments. The antibody of the present invention is further intended to include bispecific and chimeric molecules having an anti-E6-BP portion.

Both monoclonal and polyclonal antibodies (Ab) directed against E6-BP or E6-BP variants, and antibody fragments such as Fab' and F(ab')₂, can be used to block the action of E6-BP and allow the study of the role of a particular E6 binding protein of the present invention in papillomavirus infection, transformation and/or immortalization, as well as the normal cellular function of the E6-binding protein, e.g. by microinjection of anti-E6BP antibodies of the present invention.

Antibodies which specifically bind E6-BP epitopes can also be used in immunohistochemical staining of tissue samples in order to evaluate the abundance and pattern of expression of each of the subject E6-BP. Anti-E6-BP antibodies can be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate E6-BP levels in tissue or bodily fluid as part of a clinical testing procedure. For instance, such measurements can be useful in predictive valuations of the onset or progression of HPV infection. Likewise, the ability to monitor E6-BP levels in an individual can allow determination of the efficacy of a given treatment regimen for an individual afflicted with such a disorder. The level of E6-BP can be measured in cells found in bodily fluid, such as in samples of cerebral spinal fluid, or can be measured in tissue, such as produced by biopsy. Diagnostic assays using anti-E6-BP antibodies can include, for example, immunoassays designed to aid in early diagnosis of a neoplastic or hyperplastic disorder, e.g. the presence of cancerous cells in the sample, e.g. PV-infected cells, e.g. PV-transformed cells, e.g. PV-immortalized cells, e.g. to detect cells in which a lesion of the E6-BP gene has occurred.

Another application of anti-E6-BP antibodies is in the immunological screening of cDNA libraries constructed in expression vectors such as λgt11, λgt18-23, λZAP, and λORF8. Messenger libraries of this type, having coding sequences inserted in the correct reading frame and orientation, can produce fusion proteins. For instance, λgt11 will produce fusion proteins whose amino termini consist of β-galactosidase amino acid sequences and whose carboxy termini consist of a foreign polypeptide. Antigenic epitopes of a subject E6-BP can then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with anti-E6-BP antibodies. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the presence of E6-BP homologs can be detected and cloned from other sources, and alternate isoforms (including splicing variants) can be detected and cloned from human sources.

Moreover, the nucleotide sequence determined from the cloning of the subject E6-binding proteins from a human cell line will further allow for the generation of probes designed for use in identifying E6-BP homologs in other human cell types, as well as E6-BP homologs from other animals. For instance, the present invention also provides a probe/primer comprising a substantially purified oligonucleotide, wherein the oligonucleotide comprises a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or anti-sense sequence of one of SEQ ID Nos: 1-7, or naturally occurring mutants thereof. In preferred embodiments, the probe/primer further comprises a label group attached thereto and able to be detected, e.g. the label group is selected from the group consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. Such probes can be used as a part of a diagnostic test kit for identifying transformed cells, such as for measuring a level of an E6-BP nucleic acid in a sample of cells from a patient; e.g. measuring an E6-BP mRNA level; e.g. determining whether a genomic E6-BP gene has been mutated or deleted.

In addition, nucleotide probes can be generated from the cloned sequence of the subject E6-binding proteins, which allow for histological screening of intact tissue and tissue samples for the presence of an E6-BP mRNA. Similar to the diagnostic uses of anti-E6-BP antibodies, the use of probes directed to E6-BP mRNAs, or to genomic E6-BP sequences, can be used for both predictive and therapeutic evaluation of allelic mutations which might be manifest in, for example, neoplastic or hyperplastic disorders (e.g. unwanted cell growth). Used in conjunction with anti-E6-BP antibody immunoassays, the nucleotide probes can help facilitate the determination of the molecular basis for a developmental disorder which may involve some abnormality associated with expression (or lack thereof) of an E6-binding protein. For instance, variation in E6-BP synthesis can be differentiated from a mutation in the E6-BP coding sequence. Likewise, targeted destruction of the E6-BP by the papillomavirus E6 protein, as is believed to occur with p53, can be distinguished from E6 sequestering of the E6 binding protein (i.e. which can result in modification of the E6-BP cellular function).

For example, the present method provides a method for determining if a subject is at risk for a disorder characterized by unwanted cell proliferation. In preferred embodiments, the subject method can be generally characterized as comprising detecting, in a tissue of the subject (e.g. a human patient), the presence or absence of a genetic lesion characterized by at least one of (i) a mutation of a gene encoding one of the subject E6-BPs or (ii) the mis-expression of an E6-BP gene. To illustrate, such genetic lesions can be detected by ascertaining the existence of at least one of (i) a deletion of one or more nucleotides from a subject E6-BP gene, (ii) an addition of one or more nucleotides to such an E6-BP gene, (iii) a substitution of one or more nucleotides of an E6-BP gene, (iv) a gross chromosomal rearrangement of one of the subject E6-BP genes, (v) a gross alteration in the level of a messenger RNA transcript of an E6-BP gene, (vi) the presence of a non-wild type splicing pattern of a messenger RNA transcript of an E6-BP gene, and (vii) a non-wild type level of an E6-binding protein. In one aspect of the invention there is provided a probe/primer comprising an oligonucleotide containing a region of nucleotide sequence which is capable of hybridizing to a sense or antisense sequence of one of SEQ ID Nos: 1-7, or naturally occurring mutants thereof, or 5' or 3' flanking sequences or intronic sequences naturally associated with the subject E6-BP genes. The probe is exposed to nucleic acid of a tissue sample; and the hybridization of the probe to the sample nucleic acid is detected. In certain embodiments, detection of the lesion comprises utilizing the probe/primer in a polymerase chain reaction (PCR) (see, e.g., U.S. Pat. Nos.: 4,683,195 and 4,683,202) or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et al. (1988) Science, 241:1077-1080; and NaKazawa et al. (1944) PNAS 91:360-364) the later of which can be particularly useful for detecting point mutations in the E6-BP gene. Alternatively, the level of E6-binding protein can detected in an immunoassay.

Also, the use of anti-sense techniques (e.g. microinjection of antisense molecules, or transfection with plasmids whose transcripts are anti-sense with regard to an E6-BP mRNA or gene sequence) can be used to investigate role of each of the subject E6-BP in HPV-mediated events (infection, transformation and/or immortalization), as well as the normal cellular function of each of the novel E6-BPs, e.g. in cell proliferation by inhibiting endogenous production of a particular E6-binding protein. Such techniques can be utilized in cell culture, but can also be used in the creation of transgenic animals.

Another aspect of the invention features transgenic non-human animals which express a heterologous E6-BP gene of the present invention, or which have had one or more genomic E6-BP gene(s) disrupted in at least one of the tissue or cell-types of the animal. For instance, transgenic mice that are disrupted at an E6-BP gene locus can be generated.

In another aspect, the invention features an animal model for developmental diseases, which has an E6-BP allele which is mis-expressed. For example, a mouse can be bred which has an E6-BP allele deleted, or in which all or part of one or more exons are deleted. Such a mouse model can then be used to study disorders arising from mis-expression of the E6-BP gene.

Accordingly, the present invention concerns transgenic animals which are comprised of cells (of that animal) which contain a transgene of the present invention and which preferably (though optionally) express an exogenous E6-binding protein in one or more cells in the animal. The E6-BP transgene can encode the wild-type form of the protein, or can encode homologs thereof, including both agonists and antagonists, as well as antisense constructs. In preferred embodiments, the expression of the transgene is restricted to specific subsets of cells, tissues or developmental stages utilizing, for example, cis-acting sequences that control expression in the desired pattern. Tissue-specific regulatory sequences and conditional regulatory sequences can be used to control expression of the transgene in certain spatial patterns. Moreover, temporal patterns of expression can be provided by, for example, conditional recombination systems or prokaryotic transcriptional regulatory sequences.

Genetic techniques which allow for the expression of transgenes can be regulated via site-specific genetic manipulation in vivo are known to those skilled in the art. For instance, genetic systems are available which allow for the regulated expression of a recombinase that catalyzes the genetic recombination a target sequence. As used herein, the phrase "target sequence" refers to a nucleotide sequence that is genetically recombined by a recombinase. The target sequence is flanked by recombinase recognition sequences and is generally either excised or inverted in cells expressing recombinase activity. Recombinase catalyzed recombination events can be designed such that recombination of the target sequence results in either the activation or repression of expression of the subject E6-binding proteins. For example, excision of a target sequence which interferes with the expression of a recombinant E6-BP gene can be designed to activate expression of that gene. This interference with expression of the protein can result from a variety of mechanisms, such as spatial separation of the E6-BP gene from the promoter element or an internal stop codon. Moreover, the transgene can be made wherein the coding sequence of the gene is flanked recombinase recognition sequences and is initially transfected into cells in a 3' to 5' orientation with respect to the promoter element. In such an instance, inversion of the target sequence will reorient the subject gene by placing the 5' end of the coding sequence in an orientation with respect to the promoter element which allow for promoter driven transcriptional activation.

In an illustrative embodiment, either the cre/loxP recombinase system of bacteriophage P1 (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific recombination of an intervening target sequence located between loxP sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre recombinase binds and are required for Cre recombinase mediated genetic recombination. The orientation of loxP sequences determines whether the intervening target sequence is excised or inverted when Cre recombinase is present (Abremski et al. (1984) J. Biol. Chem. 259:1509-1514); catalyzing the excision of the target sequence when the loxP sequences are oriented as direct repeats and catalyzes inversion of the target sequence when loxP sequences are oriented as inverted repeats.

Accordingly, genetic recombination of the target sequence is dependent on expression of the Cre recombinase. Expression of the recombinase can be regulated by promoter elements which are subject to regulatory control, e.g., tissue-specific, developmental stage-specific, inducible or repressible by externally added agents. This regulated control will result in genetic recombination of the target sequence only in cells where recombinase expression is mediated by the promoter element. Thus, the activation expression of the E6-BP gene can be regulated via regulation of recombinase expression.

Use of the cre/loxP recombinase system to regulate expression of a recombinant E6-binding protein requires the construction of a transgenic animal containing transgenes encoding both the Cre recombinase and the subject protein. Animals containing both the Cre recombinase and a recombinant E6-BP gene can be provided through the construction of "double" transgenic animals. A convenient method for providing such animals is to mate two transgenic animals each containing a transgene, e.g., an E6-BP gene and recombinase gene.

One advantage derived from initially constructing transgenic animals containing an E6-BP transgene in a recombinase-mediated expressible format derives from the likelihood that the subject protein may be deleterious upon expression in the transgenic animal. In such an instance, a founder population, in which the subject transgene is silent in all tissues, can be propagated and maintained. Individuals of this founder population can be crossed with animals expressing the recombinase in, for example, one or more tissues. Thus, the creation of a founder population in which, for example, an antagonistic E6-BP transgene is silent will allow the study of progeny from that founder in which disruption of cell regulation in a particular tissue or at developmental stages would result in, for example, a lethal phenotype.

Similar conditional transgenes can be provided using prokaryotic promoter sequences which require prokaryotic proteins to be simultaneous expressed in order to facilitate expression of the transgene. Exemplary promoters and the corresponding trans-activating prokaryotic proteins are given in U.S. Pat. No. 4,833,080. Moreover, expression of the conditional transgenes can be induced by gene therapy-like methods wherein a gene encoding the trans-activating protein, e.g. a recombinase or a prokaryotic protein, is delivered to the tissue and caused to be expressed, such as in a cell-type specific manner. By this method, the transgene could remain silent into adulthood until "turned on" by the introduction of the trans-activator.

In an exemplary embodiment, the "transgenic non-human animals" of the invention are produced by introducing transgenes into the germline of the non-human animal. Embryonal target cells at various developmental stages can be used to introduce transgenes. Different methods are used depending on the stage of development of the embryonal target cell. The zygote is the best target for micro-injection. In the mouse, the male pronucleus reaches the size of approximately 20 micrometers in diameter which allows reproducible injection of 1-2 pl of DNA solution. The use of zygotes as a target for gene transfer has a major advantage in that in most cases the injected DNA will be incorporated into the host gene before the first cleavage (Brinster et al. (1985) PNAS 82:4438-4442). As a consequence, all cells of the transgenic non-human animal will carry the incorporated transgene. This will in general also be reflected in the efficient transmission of the transgene to offspring of the founder since 50% of the germ cells will harbor the transgene. Microinjection of zygotes is the preferred method for incorporating transgenes in practicing the invention.

Retroviral infection can also be used to introduce transgene into a non-human animal. The developing non-human embryo can be cultured in vitro to the blastocyst stage. During this time, the blastomeres can be targets for retroviral infection (Jaenich, R. (1976) PNAS 73:1260-1264). Efficient infection of the blastomeres is obtained by enzymatic treatment to remove the zona pellucida (Manipulating the Mouse Embryo, Hogan eds. (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector system used to introduce the transgene is typically a replication-defective retrovirus carrying the transgene (Jahner et al. (1985) PNAS 82:6927-6931; Van der Putten et al. (1985) PNAS 82:6148-6152). Transfection is easily and efficiently obtained by culturing the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra; Stewart et al. (1987) EMBO J. 6:383-388). Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can be injected into the blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the founders will be mosaic for the transgene since incorporation occurs only in a subset of the cells which formed the transgenic non-human animal. Further, the founder may contain various retroviral insertions of the transgene at different positions in the genome which generally will segregate in the offspring. In addition, it is also possible to introduce transgenes into the germ line by intrauterine retroviral infection of the midgestation embryo (Jahner et al. (1982) supra).

A third type of target cell for transgene introduction is the embryonal stem cell (ES). ES cells are obtained from pre-implantation embryos cultured in vitro and fused with embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255-258; Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 322:445-448). Transgenes can be efficiently introduced into the ES cells by DNA transfection or by retrovirus-mediated transduction. Such transformed ES cells can thereafter be combined with blastocysts from a non-human animal. The ES cells thereafter colonize the embryo and contribute to the germ line of the resulting chimeric animal. For review see Jaenisch, R. (1988) Science 240:1468-1474.

Methods of making knock-out or disruption transgenic animals are also generally known. See, for example, Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Recombinase dependent knockouts can also be generated, e.g. by homologous recombination to insert target sequences, such that tissue specific and/or temporal control of inactivation of an E6-BP gene can be controlled as above.

Yet another aspect of the invention pertains to methods of treating proliferative and/or differentiative disorders which arise from cells in which an E6-BP is implicated in the pathogenesis of the disorder. There are a wide variety of pathological cell proliferative conditions for which the E6-BP gene constructs, E6-binding mimetics, and E6-binding antagonists of the present invention may provide therapeutic benefits, with the general strategy being the inhibition of anomalous cell proliferation. For instance, the gene constructs of the present invention can be used as a part of a gene therapy protocol, such as to reconstitute the function of an E6-binding protein, e.g. in a cell in which the protein is misexpressed or in which transduction pathways upstream of the E6-binding protein are dysfunctional, or to inhibit the function of the wild-type protein, e.g. by delivery of a dominant negative mutant.

To illustrate, cell types which exhibit pathological or abnormal growth presumably dependent at least in part on a function of an E6-binding protein of the present invention include various cancers as well as papillomavirus-infected cells.

It will also be apparent that, by transient use of gene therapy constructs of the subject E6-binding proteins (e.g. agonist and antagonist forms) or antisense nucleic acids, in vivo control of an E6-BP function can be accomplished. In one aspect of the invention, expression constructs of the subject E6-binding proteins may be administered in any biologically effective carrier, e.g. any formulation or composition capable of effectively transfecting cells in vivo with a recombinant E6-BP gene. Approaches include insertion of the subject gene in viral vectors including recombinant retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus-1, or recombinant bacterial or eukaryotic plasmids. Viral vectors can be used to transfect cells directly; plasmid DNA can be delivered with the help of, for example, cationic liposomes (lipofectin) or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin S, artificial viral envelopes or other such intracellular carriers, as well as direct injection of the gene construct or CaPO₄ precipitation carried out in vivo. It will be appreciated that because transduction of appropriate target cells represents the critical first step in gene therapy, choice of the particular gene delivery system will depend on such factors as the phenotype of the intended target and the route of administration, e.g. locally or systemically.

A preferred approach for in vivo introduction of nucleic acid encoding one of the subject proteins into a cell is by use of a viral vector containing nucleic acid, e.g. a cDNA, encoding the gene product. Infection of cells with a viral vector has the advantage that a large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed efficiently in cells which have taken up viral vector nucleic acid.

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the recombinant gene delivery system of choice for the transfer of exogenous genes in vivo, particularly into humans. These vectors provide efficient delivery of genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. A major prerequisite for the use of retroviruses is to ensure the safety of their use, particularly with regard to the possibility of the spread of wild-type virus in the cell population. The development of specialized cell lines (termed "packaging cells") which produce only replication-defective retroviruses has increased the utility of retroviruses for gene therapy, and defective retroviruses are well characterized for use in gene transfer for gene therapy purposes (for a review see Miller, A. D. (1990) Blood 76:271). Thus, recombinant retrovirus can be constructed in which part of the retroviral coding sequence (gag, pol, env) has been replaced by nucleic acid encoding an E6-binding protein, rendering the retrovirus replication defective. The replication defective retrovirus is then packaged into virions which can be used to infect a target cell through the use of a helper virus by standard techniques. Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo with such viruses can be found in Current Protocols in Molecular Biology, Ausubel, F. M. et al. (eds.) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE and pEM which are well known to those skilled in the art. Examples of suitable packaging virus lines for preparing both ecotropic and amphotropic retroviral systems include ψCrip, ψCre, ψ2 and ψAm. Retroviruses have been used to introduce a variety of genes into many different cell types, including neural cells, epithelial cells, endothelial cells, lymphocytes, myoblasts, hepatocytes, bone marrow cells, in vitro and/or in vivo (see for example Eglitis, et al. (1985) Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:6460-6464; Wilson et al. (1988) Proc. Natl. Acad. Sci. USA 85:3014-30i8; Armentano et al. (1990) Proc. Natl. Acad. Sci. USA 87:6141-6145; Huber et al. (1991) Proc. Natl. Acad Sci. USA 88:8039-8043; Ferry et al. (1991) Proc. Natl. Acad. Sci. USA 88:8377-8381; Chowdhury et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proc. Natl. Acad. Sci. USA 89:7640-7644; Kay et al. (1992) Human Gene Therapy 3:641-647; Dai et al. (1992) Proc. Natl. Acad. Sci. USA 89:10892-10895; Hwu et al. (1993) J. Immunol. 150:4104-4115; U.S. Pat. Nos. 4,868,116; 4,980,286; PCT Application WO 89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and PCT Application WO 92/07573).

In choosing retroviral vectors as a gene delivery system for the subject E6-BP genes, it is important to note that a prerequisite for the successful infection of target cells by most retroviruses, and therefore of stable introduction of the recombinant E6-BP gene, is that the target cells must be dividing. In general, this requirement will not be a hindrance to use of retroviral vectors to deliver antagonistic E6-BP gene constructs. In fact, such limitation on infection can be beneficial in circumstances wherein the tissue (e.g. nontransformed cells) surrounding the target cells does not undergo extensive cell division and is therefore refractory to infection with retroviral vectors. For example, papillomavirus-transformed cells can have mitotic indexes much higher than surrounding untransformed squamous cells.

Furthermore, it has been shown that it is possible to limit the infection spectrum of retroviruses and consequently of retroviral-based vectors, by modifying the viral packaging proteins on the surface of the viral particle (see, for example PCT publications WO93/25234, WO94/06920, and WO94/11524). For instance, strategies for the modification of the infection spectrum of retroviral vectors include: coupling antibodies specific for cell surface antigens to the viral env protein (Roux et al. (1989) PNAS 86:9079-9083; Julan et al. (1992) J. Gen Virol 73:3251-3255; and Goud et al. (1983) Virology 163:251-254); or coupling cell surface ligands to the viral env proteins (Neda et al. (1991) J. Biol. Chem. 266:14143-14146). Coupling can be in the form of the chemical cross-linking with a protein or other variety (e.g. lactose to convert the env protein to an asialoglycoprotein), as well as by generating fusion proteins (e.g. single-chain antibody/env fusion proteins). This technique, while useful to limit or otherwise direct the infection to certain tissue types, and can also be used to convert an ecotropic vector in to an amphotropic vector.

Moreover, use of retroviral gene delivery can be further enhanced by the use of tissue- or cell-specific transcriptional regulatory sequences which control expression of the E6-BP gene of the retroviral vector.

Another viral gene delivery system useful in the present invention utilizes adenovirus-derived vectors. The genome of an adenovirus can be manipulated such that it encodes a gene product of interest, but is inactivate in terms of its ability to replicate in a normal lytic viral life cycle (see, for example, Berkner et al. (1988) BioTechniques 6:616; Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155). Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art. Recombinant adenoviruses can be advantageous in certain circumstances in that they are not capable of infecting nondividing cells and can be used to infect a wide variety of cell types, including airway epithelium (Rosenfeld et al. (1992) cited supra), endothelial cells (Lemarchand et al. (1992) Proc. Natl. Acad. Sci. USA 89:6482-6486), hepatocytes (Herz and Gerard (1993) Proc. Natl. Acad. Sci. USA 90:2812-2816) and muscle cells (Quantin et al. (1992) Proc. Natl. Acad. Sci. USA 89:2581-2584). Furthermore, the virus particle is relatively stable and amenable to purification and concentration, and as above, can be modified so as to affect the spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not integrated into the genome of a host cell but remains episomal, thereby avoiding potential problems that can occur as a result of insertional mutagenesis in situations where introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) relative to other gene delivery vectors (Berkner et al., supra; Haj-Ahmand and Graham (1986) J. Virol. 57:267). Most replication-defective adenoviral vectors currently in use and therefore favored by the present invention are deleted for all or parts of the viral E1 and E3 genes but retain as much as 80% of the adenoviral genetic material (see, e.g., Jones et al. (1979) Cell 16:683; Berkner et al., supra; and Graham et al. in Methods in Molecular Biology, E. J. Murray, Ed. (Humana, Clifton, N.J., 1991) vol. 7. pp. 109-127). Expression of the inserted E6-BP gene can be under control of, for example, the E1A promoter, the major late promoter (MLP) and associated leader sequences, the E3 promoter, or exogenously added promoter sequences.

Yet another viral vector system useful for delivery of the subject E6-BP gene is the adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring defective virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient replication and a productive life cycle. (For a review see Muzyczka et al. Curr. Topics in Micro. and Immunol. (1992) 158:97-129). It is also one of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration (see for example Flotte et al. (1992) Am. J. Respir. Cell. Mol. Biol. 7:349-356; Samulski et al. (1989) J. Virol. 63:3822-3828; and McLaughlin et al. (1989) J. Virol. 62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged and can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as that described in Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-3260 can be used to introduce DNA into cells. A variety of nucleic acids have been introduced into different cell types using AAV vectors (see for example Hermonat et al. (1984) Proc. Natl. Acad. Sci. USA 81:6466-6470; Tratschin et al. (1985) Mol. Cell. Biol. 4:2072-2081; Wondisford et al. (1988) Mol. Endocrinol. 2:32-39; Tratschin et al. (1984) J. Virol. 51:611-619; and Flotte et al. (1993) J. Biol Chem. 268:3781-3790).

Other viral vector systems that may have application in gene therapy have been derived from herpes virus, vaccinia virus, and several RNA viruses. In particular, herpes virus vectors may provide a unique strategy for persistence of the recombinant E6-BP gene in cells of the central nervous system and ocular tissue (Pepose et al. (1994) Invest Ophthalmol Vis Sci 35:2662-2666).

In addition to viral transfer methods, such as those illustrated above, non-viral methods can also be employed to cause expression of an E6-binding protein in the tissue of an animal. Most nonviral methods of gene transfer rely on normal mechanisms used by mammalian cells for the uptake and intracellular transport of macromolecules. In preferred embodiments, non-viral gene delivery systems of the present invention rely on endocytic pathways for the uptake of the subject E6-BP gene by the targeted cell. Exemplary gene delivery systems of this type include liposomal derived systems, poly-lysine conjugates, and artificial viral envelopes.

In a representative embodiment, a gene encoding an E6-binding protein can be entrapped in liposomes bearing positive charges on their surface (e.g., lipofectins) and (optionally) which are tagged with antibodies against cell surface antigens of the target tissue Mizuno et al. (1992) No Shinkei Geka 20:547-551; PCT publication WO91/06309; Japanese patent application 1047381; and European patent publication EP-A-43075). For example, lipofection of papillomavirus-infected cells can be carried out using liposomes tagged with monoclonal antibodies against papillomavirus-associated antigens.

A variety of assay formats will suffice and, in light of the present disclosure, those not expressly described herein will nevertheless be comprehended by one of ordinary skill in the art. Agents to be tested for their ability to act as E6-BP inhibitors can be produced, for example, by bacteria, yeast or other organisms (e.g. natural products), produced chemically (e.g. small molecules, including peptidomimetics), or produced recombinantly. In a preferred embodiment, the test agent is a small organic molecule, e.g., other than a peptide, oligonucleotide, or analog thereof, having a molecular weight of less than about 10,000 daltons, preferably less than 5,000 daltons, even more preferable, less than 2,000 daltons.

In many drug screening programs which test libraries of compounds and natural extracts, high throughput assays are desirable in order to maximize the number of compounds surveyed in a given period of time. Assays which are performed in cell-free systems, such as may be derived with purified or semi-purified proteins, are often preferred as "primary" screens in that they can be generated to permit rapid development and relatively easy detection of an alteration in a molecular target which is mediated by a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be generally ignored in the in vitro system, the assay instead being focused primarily on the effect of the drug on the molecular target as may be manifest in an alteration of binding affinity between an E6-binding protein and E6, or in changes in a property of the molecular target for E6-binding binding. Accordingly, in an exemplary screening assay of the present invention, the compound of interest is contacted with an isolated and purified E6-binding protein which is ordinarily capable of binding E6. To the mixture of the compound and E6-binding protein is then added a composition containing an E6 polypeptide. Detection and quantification of E6/E6-BP complexes provides a means for determining the compounds efficacy at inhibiting (or potentiating) complex formation between the E6 and E6-binding proteins. The efficacy of the compound can be assessed by generating dose response curves from data obtained using various concentrations of the test compound. Moreover, a control assay can also be performed to provide a baseline for comparison. In the control assay, isolated and purified E6 is added to a composition containing the E6-binding protein, and the formation of E6/E6-binding complex is quantitated in the absence of the test compound. It will be understood that, in general, the order in which the reactants may be admixed can be varied, and can be admixed simultaneously. Moreover, E6 can be substituted with other proteins to which any of the E6-binding proteins bind.

Complex formation between the E6-binding protein and target polypeptide may be detected by a variety of techniques. For instance, modulation of the formation of complexes can be quantitated using, for example, detectably labeled proteins such as radiolabeled (e.g. ³² P, ³⁵ S, ¹⁴ C or ³ H), fluorescently labeled (e.g. FITC), or enzymatically labeled E6-binding proteins or E6 polypeptides, by immunoassay, or by chromatographic detection.

Typically, it will be desirable to immobilize either the E6-binding protein or the E6 polypeptide to facilitate separation of E6-BP/E6 complexes from uncomplexed forms of one or both of the proteins, as well as to accommodate automation of the assay. Binding of E6 to E6-binding proteins, in the presence and absence of a candidate agent, can be accomplished in any vessel suitable for containing the reactants. Examples include microtitre plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which adds a domain that allows the protein to be bound to a matrix. For example, glutathione-S-transferase/E6-BP (GST/E6-BP) fusion proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, Mo.) or glutathione derivatized microtitre plates, which are then combined with the E6 polypeptide, e.g. an ³⁵ S-labeled E6 polypeptide, and the test compound, and the mixture incubated under conditions conducive to complex formation, e.g. at physiological conditions for salt and pH, though slightly more stringent conditions may be desired, e.g., at 4° C. in a buffer containing 0.6M NaCl or a detergent such as 0.1% Triton X-100. Following incubation, the beads are washed to remove any unbound E6 polypeptide, and the matrix immobilized radiolabel determined directly (e.g. beads placed in scintilant), or in the supernatant after the E6-BP/E6 complexes are subsequently dissociated. Alternatively, the complexes can dissociated from the matrix, separated by SDS-PAGE, and the level of E6 polypeptide found in the bead fraction quantitated from the gel using standard electrophoretic techniques such as described in the appended examples.

Other techniques for immobilizing proteins on matrices are also available for use in the subject assay. For instance, either of the E6-binding protein or E6 proteins can be immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated E6-BP molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, Ill.), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive with E6, but which do not interfere with E6-BP binding can be derivatized to the wells of the plate, and the E6 polypeptide trapped in the wells by antibody conjugation. As above, preparations of an E6-BP polypeptide and a test compound are incubated in the E6-presenting wells of the plate, and the amount of E6-BP/E6 complex trapped in the well can be quantitated. Exemplary methods for detecting such complexes, in addition to those described above for the GST-immobilized complexes, include immunodetection of complexes using antibodies reactive with one of either the E6 polypeptide or E6-BP; as well as enzyme-linked assays which rely on detecting an enzymatic activity associated with one of the polypeptides. In the instance of the latter, the enzyme can be chemically conjugated or provided as a fusion protein. To illustrate, the E6-BP polypeptide can be chemically cross-linked or genetically fused with horseradish peroxidase, and the amount of E6-BP polypeptide trapped in the complex can be assessed with a chromogenic substrate of the enzyme, e.g. 3,3'-diamino-benzadine terahydrochloride or 4-chloro-1-napthol. Likewise, a fusion protein comprising the E6 polypeptide and glutathione-S-transferase can be provided, and complex formation quantitated by detecting the GST activity using 1-chloro-2,4-dinitrobenzene (Habig et al (1974) J Biol Chem 249:7130).

For processes which rely on immunodetection for quantitating one of the proteins trapped in the complex, antibodies against the protein, such as either anti-E6 or anti-E6-BP antibodies, can be used. Alternatively, the protein to be detected in the complex can be "epitope tagged" in the form of a fusion protein which includes, in addition to the E6 polypeptide or E6-BP sequence, a second polypeptide for which antibodies are readily available (e.g. from commercial sources). For instance, the GST fusion proteins described above can also be used for quantification of binding using antibodies against the GST moiety. Other useful epitope tags include myc-epitopes (e.g., see Ellison et al. (1991) J Biol Chem 266:21150-21157) which includes a 10-residue sequence from c-myc, as well as the pFLAG system (International Biotechnologies, Inc.) or the pEZZ-protein A system (Pharamacia, N.J.).

One aspect of the present invention which facilitates the generation of drug screening assays, particularly the high-throughout assays described below, is the identification of the E6 binding motif of the E6-BP^(SD7) protein. For instance, the present invention provides portions of the SD-7 protein which may be easier to manipulate than the full length protein. As described in the appended examples, the present invention provides polypeptides which include a portions of the SD-7 protein which retain the ability to bind to the E6 protein. Such E6-binding motifs can include an amino acid sequence corresponding to Ala194-Asp218 of SEQ ID No. 8.

For instance, SD-7 polypeptide fragments useful in the subject screening assays may be represented by the general formula X-Y-Z, wherein Y represents an amino acid sequence of an E6-binding motif within residues 194 to 218 of SEQ ID No. 8, X is absent, or represents a portion of the amino acid sequence between residues 1 and 194 of SEQ ID No. 8 and (optionally) immediately N-terminal to Y, and Z is absent, or represents a portion of the amino acid sequence between residues 218 and 317 of SEQ ID No. 8 and (optionally) immediately C-terminal to Y. Preferably, the polypeptide includes only about 25 to 200 residues of SD-7 polypeptide sequence, though more preferably includes only about 25, 50, 75 or 100 amino acid residues. In illustrative embodiments, the polypeptide used to generate the subject assay includes: an SD-7 polypeptide sequence corresponding to Ala194 through about Asp218; an SD-7 polypeptide sequence corresponding to Met99 through about Leu317; an SD-7 polypeptide sequence corresponding to Val107 through about Asp218; an SD-7 polypeptide sequence corresponding to Ala194 through about Glu316.

Additionally, the subject E6-binding proteins can be used to generate an interaction trap assay, as described in the examples below (see also, U.S. Pat. No.: 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J Biol Chem 268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; and Iwabuchi et al. (1993) Oncogene 8:1693-1696), for subsequently detecting agents which disrupt binding of the E6-BP to an E6 protein. The interaction trap assay relies on reconstituting in vivo a functional transcriptional activator protein from two separate fusion proteins, one of which comprises the DNA-binding domain of a transcriptional activator fused to an E6 protein. The second fusion protein comprises a transcriptional activation domain (e.g. able to initiate RNA polymerase transcription) fused to one of the subject E6-binding proteins. When the E6 and E6-binding protein interact, the two domains of the transcriptional activator protein are brought into sufficient proximity as to cause transcription of a reporter gene. In an illustrative embodiment, Saccharomyces cerevisiae YPB2 cells are transformed simultaneously with a plasmid encoding a GAL4db-E6 fusion and with a plasmid encoding the GAL4ad domain fused to a subject E6-BP. Moreover, the strain is transformed such that the GAL4-responsive promoter drives expression of a phenotypic marker. For example, the ability to grow in the absence of histidine can depends on the expression of the HIS3 gene. When the HIS3 gene is placed under the control of a GAL4-responsive promoter, relief of this auxotrophic phenotype indicates that a functional GAL4 activator has been reconstituted through the interaction of E6 and the E6-BP. Thus, agent able to inhibit E6-BP interaction with E6 will result in yeast cells unable to growth in the absence of histidine. Alternatively, the phenotypic marker (e.g. instead of the HIS3 gene) can be one which provides a negative selection when expressed such that agents which disrupt E6/E6-BP interactions confer positive growth selection to the cells.

Moreover, in instances wherein one of the subject E6-binding proteins possess an enzymatic activity, inhibitors of the enzymatic activity can be identified using assays derived from measuring the ability of an agent to inhibit catalytic conversion of a substrate by the subject enzyme.

In another aspect, the invention features transgenic non-human animals which express a recombinant E6-BP gene of the present invention, or which have had one or more of the subject E6-BP gene(s), e.g. heterozygous or homozygous, disrupted in at least one of the tissue or cell-types of the animal.

In another aspect, the invention features an animal model for developmental diseases, which has an E6-BP allele which is mis-expressed. For example, a mouse can be bred which has an E6-BP allele deleted, or in which all or part of one or more E6-BP exons are deleted. Such a mouse model can then be used to study disorders arising from mis-expressed E6-BP genes.

Exemplification

The invention now being generally described, it will be more readily understood by reference to the following examples which are included merely for purposes of illustration of certain aspects and embodiments of the present invention, and are not intended to limit the invention.

EXAMPLE 1 Cloning of Genes Encoding Proteins which Interact with the Papillomavirus E6 Protein

To identify genes encoding proteins that associate with HPV16 E6 protein, we employed a modified two-hybrid system that utilizes a genetic selection for genes encoding interacting proteins (see, for example, Fields et al. (1989) Nature 340:245-246; Chien et al. (1991) PNAS 88:9578-9582; Morrissey et al. (1989) J Virol 63:4422-5; and Lamberti et al. (1990) EMBO J 9:1907-1913). We have adapted this "two-hybrid system" by starting with a yeast strain expressing an HPV-16 E6/bovine papillomavirus (BPV) E2 DNA-binding domain (E2R) fusion protein ("E6-E2R") and a lacZ reporter driven by a promoter containing four E2 binding elements. The E6-E2R fusion protein can bind the E2 binding sites but does not induce expression of the reporter gene. This strain was then transformed with a library of plasmids in which randomly primed HeLa cell cDNA were inserted C-terminal to the strong VP16 transcription activation domain (Dalton et al. (1992) Cell 68:597-612). VP16/cDNA fusion proteins that can interact with E6-E2R, or that can bind the reporter promoter directly, would recruit the VP16 activation domain to the E2 binding sites and activate expression of the lacZ gene, and these yeast cells would subsequently stain blue on x-gal plates.

After screening approximately approx. 10⁶ independent yeast colonies, 40 were identified that became blue on galactose/x-gal. The VP16/cDNA plasmids were recovered after transformation into E. coli. These VP-16/cDNA genes were introduced into fresh yeast that contained the E2 reporter gene along with either the E6-E2 hybrid gene, a control vector, or the portion of the E2 gene used in the initial chimera. Results of this analysis indicated that some of the VP-16/cDNAs interact with the BPV E2 portion of the hybrid, others encoded proteins that activated the reporter gene even in the absence of E6-E2, while nine were found to reproducibly stimulate lacZ expression only in the presence of the HPV 16 E6-E2. Some of the 40 clones did not activate the reporter expression under any of these conditions, and in general, these were from the lightest blue colonies on the initial screen. We have also made a chimera with an N-terminus consisting of the 202 amino acid LexA DNA binding domain and in frame with HPV 16 E6. Using a yeast strain that has a chromosomal LexA dependent promoter which regulates the Leu2 gene, we have found that growth on leucine deficient media could be provided by all six VP-16 cDNAs, while several control VP-16 chimeras were non-viable. This provides additional evidence that the cDNAs interact with HPV 16 E6.

The cDNA plasmids that specifically interacted with E6 were subjected to DNA sequence analysis using a primer initiating within the VP16 coding sequence. This provides information on the reading frame at the fusion point with the cDNA. In general using this primer we have determined about 200-300 nucleotides of DNA sequence. In all cases an in frame open reading frame has been identified. We are also sequencing with the appropriate primers the 3' end of the cDNA insert. On three occasions the same gene was found twice. These differed at the point of the fusion with the VP16 activation domain and hence were not exact duplicates of the same library clone but instead represented independent isolates.

The SD-7 clone encoded an open reading frame of 210 amino acid residues with a HDEL signal sequence at the carboxy terminal. The 5' portion of SD7 was used as a probe to isolate the full-length cDNA from a HeLa lambda phage library. Several clones with overlapping inserts were isolated. These resulted in the recovery of cDNAs that together spanned 2 kb. Thus a complete copy of cDNA encoding the SD-7 protein was obtained. The nucleotide sequence and deduced amino acid sequence of the SD-7 cDNA clone is presented in the appended sequence listing. Besides the HDEL signal sequence in the carboxy terminal, there are 4 EF-hands in the C' half of E6-BP.

Northern blot analyses under high stringency conditions with SD7 sequence (or all cDNA fragments) hybridize to a transcript of 2,000 nucleotides in size that is present at similar levels in all the cell lines tested. Furthermore, SD-7 mRNA levels do not vary even in cell lines transformed by papillomavirus E6 genes.

To confirm further the in vivo association of E6 with the subject E6-binding proteins, in vitro binding assays were performed using in vitro translated E6 and immobilized GST-E6BP. For example, we observe that GST-SD7 and GST alone were used as positive and negative controls, respectively. GST-E6-BP binds HPV16 E6 to an extent comparable to that observed with E6-AP, while GST alone does not bind to HPV 16 E6.

With the DNA sequence of the 5' and 3' end of the cDNA, we searched the GenBank/EMBL data bases for homology with previously cloned and sequenced genes. This computer search has revealed that some of the cDNAs are derived from known genes, while at least eight (clones SD 7, 8, 12, 16, 22, 28 and 32) are novel genes. Two have potential metal binding motifs which we speculate could be involved in loading zinc onto the E6 peptide. One has not been cloned from human but is related to a member of the proteolytic machinery and we suspect could complex with E6/E6-AP. Data from study of multiple HPV 16 E6 mutations cloned as E2 chimeras indicate that there is specificity in the interaction: some mutants remain positive with some VP 16 cDNAs, while others are negative. This information further supports the validity of these clones.

Furthermore, a deposit of each of these clones as a library of pRS306 plasmids (designated "pRS306-E6BP") containing the 8 different novel clones isolated in the E6 interaction trap has been made with the American Type Culture Collection (Rockville, Md.) on Jul. 8, 1994, under the terms of the Budapest Treaty. ATCC Accession number 75827 has been assigned to the deposit. With this deposit in hand, one of ordinary skill in the art can generate the subject recombinant E6-BP genes and express recombinant forms of the subject E6-binding proteins. For instance, each of the E6-binding proteins of the present invention can be amplified from ATCC deposit no. 75827 by PCR using the following primers:

    ______________________________________                                         5'40 -TAC ATT AGG TCC TTT GTA GC-3'40  (SEQ ID No. 15)                         5'40 -GGC GTG AAT GTA AGC GTG AC-3'40  (SEQ ID No. 16)                         ______________________________________                                    

which prime amplification of the cDNA insert by hybridizing upstream of the VP-16 gene and downstream of the cDNA insert, respectively. The primer

    5'G CAG ATG TTT ACC GAT GCC C-3' (SEQ ID No. 17)

which primes within the VP16 gene and near the VP16/cDNA boundary, can also be used to isolate the clones of the ATCC deposit.

Moreover, it will be immediately evident to those skilled in the art that, in light of the guide to the 5' (and in some instances the 3' ends) to each of the clones provided in Table 1, each individual clone of the ATCC deposit can be isolated using primers based on the nucleotide sequences provided by SEQ ID Nos. 1-7, or a combination of such primers and the primers of SEQ ID Nos. 15, 16 and 17.

Isolated clones can be subcloned into expression vectors in order to produce a recombinant protein, or can be used to generate anti-sense constructs, or can be used to generate oligonucleotide probes. In an illustrative embodiment, oligonucleotide probes have been generated using the coding sequences for each of the clones of the subject ATCC deposit, and used in Southern hybridization and in situ hybridization assays to detect the pattern and abundance of expression of each of the E6-binding proteins.

Moreover, because each member of the ATCC deposit is a plasmid encoding a fusion protein identified from an interaction trap assay, the clone can be utilized directly from the deposit in a similar ITS employed as, for example, a drug screening assay, or alternatively, a mutagenesis assay for mapping E6 binding epitopes.

                  TABLE 1                                                          ______________________________________                                         Guide to pRS306-EgBP                                                           Clone  Nucleotide Sequence                                                                           Peptide Sequence                                                                            Name                                        ______________________________________                                         SD-7   SEQ ID No. 1   SEQ ID No. 8 E6-BP.sup.SD-7                              SD-8   SEQ ID No. 2   SEQ ID No. 9 E6-BP.sup.SD-8                              SD-12  SEQ ID No. 3   SEQ ID No. 10                                                                               E6-BP.sup.SD-12                             SD-16  SEQ ID No. 4   SEQ ID No. 11                                                                               E6-BP.sup.SD-16                             SD-22  SEQ ID No. 5   SEQ ID No. 12                                                                               E6-BP.sup.SD-22                             SD-28  SEQ ID No. 6   SEQ ID No. 13                                                                               E6-BP.sup.SD-28                             SD-32  SEQ ID No. 7   SEQ ID No. 14                                                                               E6-BP.sup.SD-32                             ______________________________________                                    

Bacterial and Yeast Strains

E. coli DH5 a (supE44, ΔlacU169 (80lacZdeltaM15), hsdR17, recA1, endA1, gyrA96, thi1, relA 1) were the transformation recipient for all plasmid constructions unless otherwise indicated. Yeast strain DBY1 was derived from BGW1-7a (MATα leu2-3 leu2-112 his4-519 ade1-100 ura3-52) by inactivating of the TRP1 gene. DBY1 was used as a host for the yeast two-hybrid system.

Plasmids

The URA selection gene in pBY-4 was inactivated by digestion with Stu I and replaced with the LEU2 gene from plasmid CV-13 (Morrissey et al. (1989) J Virol 63:4422-4425) to make pL-72. pE6E2T was constructed by inserting Bam HI-Sal I fragment containing the HPV-16E6 gene fused at its C-terminus to the BPV E2 DNA binding domain from pKPHPV16E6E2 into the Bam HI and Sal I sites of pYEplac112G pYEplac112GE2-R was made from pYEplac112GE2 by deletion of Nco I fragments that contains BPV-1 E2 activation domain.

pGEX plasmids (Pharmacia were used for expression of GST fusion proteins. For Example, pGSTSD7 was constructed by ligating polymerase chain reaction (PCR) products of the SD7 clone into pGEX-3T. pGST6E6 and pGST16E6 were constructed by ligation of the HPV6 E6 and HPV16 E6 open reading frames into pGEX-2T. Plasmid encoding GST-E6-AP have been previously described (Huibregtse et al (1993) Mol Cell Biol 13:4918-4927, 1993). pSP65 plasmid were used for cloning genes for in vitro transcription/translation. pSPBPVE6, pSP8E6 and pSP3 1E6 were constructed by ligation of the BPV-1 E6; HPV8 E6 and HPV3 1 E6 open reading frames into proper sites of pSP65. pSP6E6 and pSP 1 6E6 have been described (Crook et al. (1991) Cell 67:547-556). pSP11E6 and pSP18E6 have also been described (Werness et al. (1990) Science 248:76-79). pSP7 contains E6-BP fragment from pSD7 in pSP65.

Library Screening

All yeast transformations were done by the lithium acetate method (Schiestl et al. (1989), Curr Genet 16:339-346). The yeast strain DBY1 was transformed with pL-72 and pE6E2T to generate strain DLE6E2, which was maintained under selection for the LEU2⁺ and TRP1⁺ markers. DLE6E2 cells were then transformed with a library of yeast shuttle vector plasmids in which randomly primed HeLa cell cDNA was inserted C-terminal to the VP16 transcription activation domain. Transformants were plated on trp⁻, ura⁻, and leu⁻ selective minimal medium (YMM) containing 2% glucose as a carbon source. After 48-72 hr incubation at 30° C., colonies were transferred on filters to fresh plates containing selective medium and 2% galactose and incubated for an additional 18 hr to induce E6E2 and VP16/cDNA expression. The filters were then transferred to plates containing selective medium 2% galactose as well as X-gal. Color development time ranged from 8 to 24 hr, during which blue colonies were picked and processed as follows.

The VP16/cDNA plasmids were recovered from positive (blue) colonies after incubation in ura⁻ selective liquid YMM containing 2% glucose for a week or more and subsequently transformed into DH5α. These VP16/cDNA genes were introduced into fresh DLE6E2 or yeast that contained the pL-72 along with pE2-R (DLE2-R). Those clones that result in a blue colony only in DLE6E2 but not DLE2-R were considered E6 specific and were studied further.

cDNA Cloning and Sequencing

To obtain cDNAs containing the full-length coding sequence of E6-binding proteins, a random-primed (Clontech) as well as a poly-d(T) primed keratinocytes cDNA libraries in λgt11 were screened at high stringency with ³² P-labeled 5' fragments of the E6-BPs prepared using a random primers labeling kit with the non radioactive label, digoxigenin-dUTP (Boehringer Mannheim). Several positive lambda clones were isolated and inserts were subcloned into pBluescript II KS ± as a Sac I-Kpn I fragment or pUC 19 as PCR products. Double-stranded DNA was sequenced by the dideoxy chain termination method (Sanger et al. (1977) PNAS 74:5463-5467) using Sequenase reagents (U.S. Biochemical). Comparison of the sequence with the data bases was performed using GCG (Genetics Computer Group) FASTA program.

Protein Expression and Antibodies

Glutathione S-transferase (GST) fusion proteins were expressed in E. coli strain DH5α or JM109. One liter cultures were inoculated with 100 ml of stationary culture and grown for 1 hr before induced with 0.2 mM IPTG for 3 hr. Cells were harvested by centrifugation, resuspended in 50 ml of low salt association buffer (LSAB, 100 mM Tris-HCl, pH8.0, 100 mM NaCl, 1% NP-40, and 1 mM phenylmethylsulfonyl fluoride) plus 0.03% SDS, 2 mM DTT, and lysed by sonication. After centrifugation at 10,000 g for 10 min., supernatant were collected and mixed with glutathione Sepharose (Pharmacia). The mixture were rotated at 4° C. for 2 hr. The beads were then collected by centrifugation at 1000 g for 2 min., washed three times with 20 volume of LSAB, stored at 4° C.

In vitro-translated proteins were prepared by using the rabbit reticulocyte lysate translation system (Promega) and ³⁵ S-labeled cysteine or methionine (ICN).

A GST-SD7 fusion protein was purified and used to inject a rabbit. Serum was obtained from collected blood samples and used for immunoprecipitation.

In Vitro Binding

For in vitro binding, 30 μl of glutathione Sepharose containing approximately 2 μg of GST fusion proteins were combined with 2-20 μl of ³⁵ S-labeled in vitro translated proteins in LSAB in a total volume of 250 μl. The mixtures were rotated for 3 hr at 4° C. The mixtures were then washed six times with LSAB, boiled in SDS-gel loading buffer, and electrophoresed on SDS-polyacrylamide gels. Gels were fixed, soaked in autofluor (Dupont), dried, and exposed to Kodak XAR films. Gels were also scanned with a Molecular Dynamic Phosphor Imager.

EXAMPLE 2 Identification of E6-Binding Motif

Results

In order to determine the region of SD-7 that interacts with HPV16 E6, three types of deletions were introduced into the original isolate of SD-7 (i.e., the C-terminal 210 amino acid fragment of SD-7): N-terminal, C-terminal, and internal in-frame. These were constructed either with convenient restriction sites to drop out a DNA fragment, or employing PCR products to produce deletions or partial cDNAs. Mutant proteins were synthesized in E. coli as GST fusions. Equal amount of GST-SD-7 fusion proteins were assayed for their abilities to associate with in vitro-translated ³⁵ S-labeled HPV16 E6. FIG. 2 shows a schematic representation of the results. These studies revealed that the region between amino acid residues 194 and 218 is critical for interaction with HPV-16 E6. Any construct containing this region was capable of binding to E6, while mutants without this region could not bind. Thus, the region between amino acid residues 194 and 218 appears to contain the site for E6 binding. Amino acids outside this region may contribute to the interaction, as none of the deletion mutants binds as effectively as the original GST-SD-7. This E6 binding motif falls within the fourth EF-hand in SD-7, and contains all the putative loop sequences plus a few amino acids on both sides from the alpha-helical sequences.

To confirm these results, we engineered an additional GST fusion that contains only 25 amino acid E6-binding motif (amino acid residues 194 and 218) from the fourth EF-hand (GSTSD7M) and tested it in the in vitro binding assay. We also made a deletion of the fourth EF-hand (GSTSD7dlM, dl194 and 217) from the original isolate of E6BP. As shown in FIG. 3, GSTSD7M binds efficiently to HPV-16 E6, while GSTSD7dlM does not. As expected, SD-7 portions containing the complete fourth repeat of EF-hand of 36 amino acid residues (GSTSD7EF4) binds HPV-16 E6. As a control, a GST fusion protein containing the fifth repeat of EF-hand domain of 36 amino acid residues (GSTSD7EF5) does not bind.

Materials and methods

Plasmids. The plasmids encoding GST-E6B fusion protein (pGSTSD7), pVP16:E6BP, p16E6:E2R, and pL72 were described previously (Science 269:529-531). Modified pGEX2T (Pharmacia) contains BamH I, xho I, Cla I, Spe I, Xba I at the fusion point. Modified pGEX3X (Pharmacia) contains BamH I, Xho I, Cla I, Spe I, Xba I, Kpn I, Spe I, and EcoR I at the fusion point. Plasmid pSP16E6 and pProp53SP65 were obtained from Karen Vousden (Cell 67:547-556).

Construction of mutants. For GST-E6BP deletions, N, N1, N2, dlM1 and dlM2, restriction sites were used to delete partial coding sequences from pGSTSD7. Mutant dlM was created by PCR amplification of pGSTSD7 with the following primers: GST 5' primer CGATCGGGATCCGCTAGCATGTCCCCTATACTAGGT (SEQ ID No. 18) and GCGGGATCCTCTTGAATGACAAATTCCG (SEQ ID No. 19). The fragment was digested with Msc I and BamH I and then used to replace the Msc I-BamH I fragment in pGSTSD7. GST-E6BP mutant C was constructed by insertion of a BamH I-EcoR I fragment from pGSTSD7 into the BamH I and EcoR I sites of pGEX1 (pharmacia). A Hind III-XhoI fragment of pGSTSD7 was inserted into the modified pGEX2T to create GST-E6BP mutant C1.

Mutant EF4 was created by PCR amplification of pGTSD7 with primers GCGGGATCCTGACGGAATTTGTCATTCAAG (SEQ ID No. 20) and ATTCTCGAGCTAATTTGCAGTTGGGTCCCACC (SEQ ID No. 21); EF5 was created by PCR amplification of pGSTSD7 with primers GCGGGATCCTGATACTTGTTGAGAAAGACAG (SEQ ID No. 22) and ATTCTCGAGCTAAATGCCCTGATTATTAGG (SEQ ID No. 23). The fragments were digested with BamH I and Xho I, inserted into the modified pGEX3X.

Protein expression. Glutathione S-transferase (GST) fusion proteins were expressed in E. coli strain DH5α or JM 109. One liter cultures were inoculated with 100 ml of stationary culture and grown for 1 hour before induction with 0.2 mM IPTG for 3 hr. Cells were harvested by centrifugation, re-suspended in 50 ml of low salt association buffer (LSAB, 100 mM Tris-HCl, pH8.0, 100 mM NaCl, 1% NP-40, and 1 mM phenylmethylsulfonyl fluoride) plus 0.03% SDS, s mM DTT, and lysed by sonication. After centrifugation at 10,000 g for 10 minutes, supernatant was collected and mixed with glutathione Sepharose (Pharmacia). The mixture was rotated at 4° C. for two hours. The beads were then collected by centrifugation at 1000 g for 2 minutes, washed three times with 20 volume of LSAB, stored at4° C.

In vitro-translated E6 proteins were prepared by using the rabbit reticulocyte lysate translation system (Promega) and ³⁵ S-labeled cysteine or methionine (ICN).

In vitro binding. For in vitro binding, 30 μl of glutathione Sepharose containing approximately 2 μg of GST fusion proteins were combined with 2-20 μl of ³⁵ S-labeled in vitro translated proteins in LSAB in a total volume of 250 μl. The mixtures were rotated for 3 hours at 4° C. The mixtures were then washed six times with LSAB, boiled in SDS-gel loading buffer, and electrophoresed on SDS-polyacrylamide gels. Gels were fixed, dried, and scanned by Molecular Imager (Bio-Rad).

All of the above-cited references and publications are hereby incorporated by reference.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                 - (1) GENERAL INFORMATION:                                                     -    (iii) NUMBER OF SEQUENCES: 23                                             - (2) INFORMATION FOR SEQ ID NO:1:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 954 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                  - ATGCGGCTGG GCCCGAGGAC CGCGGCGTTG GGGCTGCTGC TGCTGTGCGC CG - #CCGCGGCC          60                                                                           - GGCGCCGGCA AGGCCGAGGA GCTGCACTAC CCGCTGGGCG AGCGCCGCAG CG - #ACTACGAC         120                                                                           - CGCGAGGCGC TGCTGGGCGT CCAGGAAGAT GTGGATGAAT ATGTTAAACT CG - #GCCACGAA         180                                                                           - GAGCAGCAAA AAAGACTGCA GGCGATCATA AAGAAAATCG ACTTGGACTC AG - #ATGGCTTT         240                                                                           - CTCACTGAAA GTGAACTCAG TTCATGGATT CAGATGTCTT TTAAGCATTA TG - #CTATGCAA         300                                                                           - GAAGCAAAAC AACAGTTTGT TGAATATGAT AAAAACAGTG ATGATACTGT GA - #CTTGGGAT         360                                                                           - GAATATAACA TTCAGATGTA TGATCGTGTG ATTGACTTTG ATGAGAACAC TG - #CTCTGGAT         420                                                                           - GATGCAGAAG AGGAGTCCTT TAGGAAGCTT CACTTAAAGG ACAAGAAGCG AT - #TTGAAAAA         480                                                                           - GCTAACCAGG ATTCAGGTCC CGGTTTGAGT CTTGAAGAAT TTATTGCTTT TG - #AGCATCCT         540                                                                           - GAAGAAGTTG ATTATATGAC GGAATTTGTC ATTCAAGAAG CTTTAGAAGA AC - #ATGACAAA         600                                                                           - AATGGTGATG GATTTGTTAG TTTGGAAGAA TTTCTTGGTG ATTACAGGTG GG - #ATCCAACT         660                                                                           - GCAAATGAAG ATCCAGAATG GATACTTGTT GAGAAAGACA GATTCGTGAA TG - #ATTATGAC         720                                                                           - AAAGATAACG ATGGCAGGCT TGATCCCCAA GAGCTGTTAC CTTGGGTAGT AC - #CTAATAAT         780                                                                           - CAGGGCATTG CACAAGAGGA GGCGCTTCAT CTAATTGATG AAATGGATTT GA - #ATGGTGAC         840                                                                           - AAAAAGCTCT CTGAAGAAGA GATTCTGGAA AACCCGGACT TGTTTCTCAC CA - #GTGAAGCC         900                                                                           - ACAGATTATG GCAGACAGCT CCATGATGAC TATTTCTATC ATGATGAGCT TT - #AA               954                                                                           - (2) INFORMATION FOR SEQ ID NO:2:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 688 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                  - CCATCTCTAA CGAAGTGCCG GAGCCCCATG TGTATCCCCT GTCTCTAATC AT - #GTTTATGA          60                                                                           - GCGGCGGCTC ATCGACAAGT ACATTGCGGA GAATGGTACC GACCCCATCA AC - #AACCAGCC         120                                                                           - TCTCTCCGAG GAGCAGCTCA TCGACATCTG GCTGCCTCCA TTTCCTTCTG GA - #CCACCAAT         180                                                                           - AATGGCCCCA CCACCTCCCA TATGTCCAGA TTCTGTTGAT GATGCTGATG CT - #TTGGGAAG         240                                                                           - TATGTTAATT TATGGTACAG AGTGGCTATA TACGGCTATA TATGGTTGTC GG - #AGGAGCAG         300                                                                           - CTCATCGGAC ATCAAAGTTG CTCACCCAAT CCGGCCCAAG CCTCCCTCAG GC - #CACCAGCA         360                                                                           - TCCCGGCCAT TCTGAAAGCT TTGCAGGATG AGTGGGATGC AGTCATGCTG CA - #CAGCTTCA         420                                                                           - CTCTGCGCCA GCAGCTGCAG ACAACCCGCC AAGAGCTGTC ACACGCTCTG TA - #CCAGCACG         480                                                                           - ATGCCGCCTG CCGTGTCATT GCCCGTCTCA CCAAGGAAGT CACTGCTGCC CG - #AGAAGCTC         540                                                                           - TGGCTACCCT GAAACCACAG GCTGGCCTCA TTGTGCCCCA GCTGTGCCAA GT - #TCCCAACC         600                                                                           - AAGTGTTGTG GGTGCGGGTG AGCCAATGGA TTTGGGTGAG CTGGTGGGAA TG - #ACCCCAGA         660                                                                           #            688   CAAG ACAAAGCC                                               - (2) INFORMATION FOR SEQ ID NO:3:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 234 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                  - CTTTTTTTTT TTTTTTTGGC TATTTTAATA TTTTTTATTA AGGGCTATAA AA - #ATACCCAG          60                                                                           - AAGATAAATA AATGTGATGC AATGATATCT GTCCTAATAT GAAGAACTTT CT - #TTCACTGC         120                                                                           - ATTCTTTTCC TTCACAATGG CCTTCAAATC ACAGGAGGCA GTCATTCCAT GC - #CATTTCCT         180                                                                           - CTTCTTTTAT TACACGCTAC AGGATTTCCT TTAGAGCACA ATGGCTCGAG AT - #CG               234                                                                           - (2) INFORMATION FOR SEQ ID NO:4:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 229 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                  - AAGGCGGCCG ACAGGACAAG GAGCATCCAA GATACCTGAT CCCAGAACTT GC - #AAACAGTT          60                                                                           - TTACCATTTA GGCTGGGTCA CTGGGACTGG AGGAGGAATT AGCTTGAAGC AT - #GGGTGAAA         120                                                                           - TCTACATTGC TCCTTCAGGA TGCAAAAGGA ACGAATTCAG CCTGAAGACA TG - #TTTGTTTG         180                                                                           #              229GGACA TAAGTGGACC TTCGCCATCG AAGAAGCTA                        - (2) INFORMATION FOR SEQ ID NO:5:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 538 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                  - GCTACACTAG AGCAGAGTAC GAGTCTGAGG CGGAGGGAGT AATGGCAGGA CA - #AGCGTTTA          60                                                                           - GAAAGTTTCT TCCACTCTTT GACCGAGTAT TGGTTGAAAG GAGTGCTGCT GA - #AACTGTAA         120                                                                           - CCAAAGGAGG CATTATGCTT CCAGAAAAAT CTCAAGGAAA AGTATTGCAA GC - #AACAGTAG         180                                                                           - TCGCTGTTGG ATCGGGTTCT AAAGGAAAGG GTGGAGAGAT TCAACCAGTT AG - #CGTGAAAG         240                                                                           - TTGGAGATAA AGTTCTTCTC CCAGAATATG GAGGCACCAA AGTAGTTCTA GA - #TGACAAGG         300                                                                           - ATTATTTCCT ATTTAGAGAT GGTGACATTC TTGGAAAGTA CGTAGACTGA AA - #TAAGTCAC         360                                                                           - TATTGAAATG GCATCAACAT GATGCTGCCC ATTCCACTGA AGTTCTGAAA TC - #TTTCGTCA         420                                                                           - TGTAAATAAT TTCCATATTT CTCTTTTATA ATAAACTAAT GATAACTAAT GA - #CATCCAGT         480                                                                           - GTCTCCAAAA TTGTTTCCTT GTACTGATAT AAACACTTCC AAATAAAAAT AT - #GTAAAT           538                                                                           - (2) INFORMATION FOR SEQ ID NO:6:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 288 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                  - ATCAAAATAG CCTGCTTGAC GATGTTGTTT AAATCAGACA CCCACCCCTA TT - #GTTATTCT          60                                                                           - GAATGAGAGA CACCTACCTG TATTATCTTC AGAGCAATGT TCACAGATCG GT - #CATGATTC         120                                                                           - AAGTTTTTGT TTGTTTCATT GTCTCCTAAC TGGTAAAATC CATACACGCC CA - #TTTCAGTT         180                                                                           - CATATTTATC TTCATACCTG TTGGGCCTAA CATCGTTCCT GATATATAGT GG - #GTGTTCTA         240                                                                           #               288GAAT AAGTTGGTTA ATGAGTAAAA TATAGGTT                         - (2) INFORMATION FOR SEQ ID NO:7:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 255 base                                                           (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: cDNA                                                 -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                  - ACCCAAGTCA ATAGAAGCCG GCGTAAAGAG TGTTTTAGAT CACCCCCTCC CC - #AATAAAGC          60                                                                           - TAAAACTCAC CTGAGTTGTA AAAAACTCCA GTTGACACAA AATAGACTAC GA - #AAGTGGCT         120                                                                           - TTAACATATC TGAACACACA ATAGCTAAGA CCCAAACTGG ATTAGATACC CC - #ACTATGCT         180                                                                           - TAAGCCCTAA ACCTCAACAG TTAAATCAAC AAAACTGCTC GCCAGAACGC TG - #GGGCCACA         240                                                                           #   255                                                                        - (2) INFORMATION FOR SEQ ID NO:8:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 317 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: protein                                              -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                  - Met Arg Leu Gly Pro Arg Thr Ala Ala Leu Gl - #y Leu Leu Leu Leu Cys          #                 15                                                           - Ala Ala Ala Ala Gly Ala Gly Lys Ala Glu Gl - #u Leu His Tyr Pro Leu          #             30                                                               - Gly Glu Arg Arg Ser Asp Tyr Asp Arg Glu Al - #a Leu Leu Gly Val Gln          #         45                                                                   - Glu Asp Val Asp Glu Tyr Val Lys Leu Gly Hi - #s Glu Glu Gln Gln Lys          #     60                                                                       - Arg Leu Gln Ala Ile Ile Lys Lys Ile Asp Le - #u Asp Ser Asp Gly Phe          # 80                                                                           - Leu Thr Glu Ser Glu Leu Ser Ser Trp Ile Gl - #n Met Ser Phe Lys His          #                 95                                                           - Tyr Ala Met Gln Glu Ala Lys Gln Gln Phe Va - #l Glu Tyr Asp Lys Asn          #           110                                                                - Ser Asp Asp Thr Val Thr Trp Asp Glu Tyr As - #n Ile Gln Met Tyr Asp          #       125                                                                    - Arg Val Ile Asp Phe Asp Glu Asn Thr Ala Le - #u Asp Asp Ala Glu Glu          #   140                                                                        - Glu Ser Phe Arg Lys Leu His Leu Lys Asp Ly - #s Lys Arg Phe Glu Lys          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Ala Asn Gln Asp Ser Gly Pro Gly Leu Ser Le - #u Glu Glu Phe Ile Ala          #               175                                                            - Phe Glu His Pro Glu Glu Val Asp Tyr Met Th - #r Glu Phe Val Ile Gln          #           190                                                                - Glu Ala Leu Glu Glu His Asp Lys Asn Gly As - #p Gly Phe Val Ser Leu          #       205                                                                    - Glu Glu Phe Leu Gly Asp Tyr Arg Trp Asp Pr - #o Thr Ala Asn Glu Asp          #   220                                                                        - Pro Glu Trp Ile Leu Val Glu Lys Asp Arg Ph - #e Val Asn Asp Tyr Asp          225                 2 - #30                 2 - #35                 2 -        #40                                                                            - Lys Asp Asn Asp Gly Arg Leu Asp Pro Gln Gl - #u Leu Leu Pro Trp Val          #               255                                                            - Val Pro Asn Asn Gln Gly Ile Ala Gln Glu Gl - #u Ala Leu His Leu Ile          #           270                                                                - Asp Glu Met Asp Leu Asn Gly Asp Lys Lys Le - #u Ser Glu Glu Glu Ile          #       285                                                                    - Leu Glu Asn Pro Asp Leu Phe Leu Thr Ser Gl - #u Ala Thr Asp Tyr Gly          #   300                                                                        - Arg Gln Leu His Asp Asp Tyr Phe Tyr His As - #p Glu Leu                      305                 3 - #10                 3 - #15                            - (2) INFORMATION FOR SEQ ID NO:9:                                             -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 216 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                  - Ile Ser Asn Glu Val Pro Glu Pro His Val Ty - #r Pro Leu Ser Leu Ile          #                15                                                            - Met Phe Met Ser Gly Gly Ser Ser Thr Ser Th - #r Leu Arg Arg Met Val          #            30                                                                - Pro Thr Pro Ser Thr Thr Ser Leu Ser Pro Ar - #g Ser Ser Ser Ser Thr          #        45                                                                    - Ser Gly Cys Leu His Phe Leu Leu Asp His Gl - #n Xaa Trp Pro His His          #    60                                                                        - Leu Pro Tyr Val Gln Ile Leu Leu Met Met Le - #u Met Leu Trp Glu Val          #80                                                                            - Cys Xaa Phe Met Val Gln Ser Gly Tyr Ile Ar - #g Leu Tyr Met Val Val          #                95                                                            - Gly Gly Ala Ala His Arg Thr Ser Lys Leu Le - #u Thr Gln Ser Gly Pro          #           110                                                                - Ser Leu Pro Gln Ala Thr Ser Ile Pro Ala Il - #e Leu Lys Ala Leu Gln          #       125                                                                    - Asp Glu Trp Asp Ala Val Met Leu His Ser Ph - #e Thr Leu Arg Gln Gln          #   140                                                                        - Leu Gln Thr Thr Arg Gln Glu Leu Ser His Al - #a Leu Tyr Gln His Asp          145                 1 - #50                 1 - #55                 1 -        #60                                                                            - Ala Ala Cys Arg Val Ile Ala Arg Leu Thr Ly - #s Glu Val Thr Ala Ala          #               175                                                            - Arg Glu Ala Leu Ala Thr Leu Lys Pro Gln Al - #a Gly Leu Ile Val Pro          #           190                                                                - Gln Leu Cys Gln Val Pro Asn Gln Val Leu Tr - #p Val Arg Val Ser Gln          #       205                                                                    - Trp Ile Trp Val Ser Trp Trp Glu                                              #   215                                                                        - (2) INFORMATION FOR SEQ ID NO:10:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 78 amino                                                           (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                 - Leu Phe Phe Phe Phe Leu Ala Ile Leu Ile Ph - #e Phe Ile Lys Gly Tyr          #                15                                                            - Lys Asn Thr Gln Lys Ile Asn Lys Cys Asp Al - #a Met Ile Ser Val Leu          #            30                                                                - Ile Xaa Arg Thr Phe Phe His Cys Ile Leu Ph - #e Leu His Asn Gly Leu          #        45                                                                    - Gln Ile Thr Gly Gly Ser His Ser Met Pro Ph - #e Pro Leu Leu Leu Leu          #    60                                                                        - His Ala Thr Gly Phe Pro Leu Glu His Asn Gl - #y Ser Arg Ser                  #75                                                                            - (2) INFORMATION FOR SEQ ID NO:11:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 75 amino                                                           (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                 - Gly Gly Arg Gln Asp Lys Glu His Pro Arg Ty - #r Leu Ile Pro Glu Leu          #                15                                                            - Ala Asn Ser Phe Thr Ile Xaa Ala Gly Ser Le - #u Gly Leu Glu Glu Glu          #            30                                                                - Leu Ala Xaa Ser Met Gly Glu Ile Tyr Ile Al - #a Pro Ser Gly Cys Lys          #        45                                                                    - Arg Asn Glu Phe Ser Leu Lys Thr Cys Leu Ph - #e Val Ile Xaa Met Lys          #    60                                                                        - Arg Thr Xaa Val Asp Leu Arg His Arg Arg Se - #r                              #75                                                                            - (2) INFORMATION FOR SEQ ID NO:12:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 102 amino                                                          (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                 - Met Ala Gly Gln Ala Phe Arg Lys Phe Leu Pr - #o Leu Phe Asp Arg Val          #                15                                                            - Leu Val Glu Arg Ser Ala Ala Glu Thr Val Th - #r Lys Gly Gly Ile Met          #            30                                                                - Leu Pro Glu Lys Ser Gln Gly Lys Val Leu Gl - #n Ala Thr Val Val Ala          #        45                                                                    - Val Gly Ser Gly Ser Lys Gly Lys Gly Gly Gl - #u Ile Gln Pro Val Ser          #    60                                                                        - Val Lys Val Gly Asp Lys Val Leu Leu Pro Gl - #u Tyr Gly Gly Thr Lys          #80                                                                            - Val Val Leu Asp Asp Lys Asp Tyr Phe Leu Ph - #e Arg Asp Gly Asp Ile          #                95                                                            - Leu Gly Lys Tyr Val Asp                                                                  100                                                                - (2) INFORMATION FOR SEQ ID NO:13:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 95 amino                                                           (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                 - Gln Asn Ser Leu Leu Asp Asp Val Val Xaa Il - #e Arg His Pro Pro Leu          #                15                                                            - Leu Leu Phe Xaa Met Arg Asp Thr Tyr Leu Ty - #r Tyr Leu Gln Ser Asn          #            30                                                                - Val His Arg Ser Val Met Ile Gln Val Phe Va - #l Cys Phe Ile Val Ser          #        45                                                                    - Xaa Leu Val Lys Ser Ile His Ala His Phe Se - #r Ser Tyr Leu Ser Ser          #    60                                                                        - Tyr Leu Leu Gly Leu Thr Ser Phe Leu Ile Ty - #r Ser Gly Cys Ser Ile          #80                                                                            - Asn Ile Tyr Xaa Met Asn Lys Leu Val Asn Gl - #u Xaa Asn Ile Gly              #                95                                                            - (2) INFORMATION FOR SEQ ID NO:14:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #acids    (A) LENGTH: 84 amino                                                           (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: peptide                                              -      (v) FRAGMENT TYPE: N-terminal                                           -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                 - Pro Ser Gln Xaa Lys Pro Ala Xaa Arg Val Ph - #e Xaa Ile Thr Pro Ser          #                15                                                            - Pro Ile Lys Leu Lys Leu Thr Xaa Val Val Ly - #s Asn Ser Ser Xaa His          #            30                                                                - Lys Ile Asp Tyr Glu Ser Gly Phe Asn Ile Se - #r Glu His Thr Ile Ala          #        45                                                                    - Lys Thr Gln Thr Gly Leu Asp Thr Pro Leu Cy - #s Leu Ser Pro Lys Pro          #    60                                                                        - Gln Gln Leu Asn Gln Gln Asn Cys Ser Pro Gl - #u Arg Trp Gly His Ser          #80                                                                            - Leu Lys Leu Lys                                                              - (2) INFORMATION FOR SEQ ID NO:15:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 20 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                 # 20               TAGC                                                        - (2) INFORMATION FOR SEQ ID NO:16:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 20 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                 # 20               TGAC                                                        - (2) INFORMATION FOR SEQ ID NO:17:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 20 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                 # 20               GCCC                                                        - (2) INFORMATION FOR SEQ ID NO:18:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 36 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                 #       36         GCAT GTCCCCTATA CTAGGT                                      - (2) INFORMATION FOR SEQ ID NO:19:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 28 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                 #             28   TGAC AAATTCCG                                               - (2) INFORMATION FOR SEQ ID NO:20:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 30 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                 #           30     ATTT GTCATTCAAG                                             - (2) INFORMATION FOR SEQ ID NO:21:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 32 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                 #          32      GCAG TTGGGTCCCA CC                                          - (2) INFORMATION FOR SEQ ID NO:22:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 31 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                 #          31      TGTT GAGAAAGACA G                                           - (2) INFORMATION FOR SEQ ID NO:23:                                            -      (i) SEQUENCE CHARACTERISTICS:                                           #pairs    (A) LENGTH: 30 base                                                            (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                 -     (ii) MOLECULE TYPE: other nucleic acid                                   -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                 #           30     CCCT GATTATTAGG                                             __________________________________________________________________________

Equivalents

Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims. 

We claim:
 1. A substantially pure preparation of the E6-BP^(SD-7) polypeptide of SEQ ID NO:
 8. 2. A chimeric polypeptide comprising:an E6-BP^(SD-7) polypeptide sequence comprising residues 194 to 218 of SEQ ID No. 8, and a second amino acid sequence unrelated to an E6-BP^(SD-7) protein.
 3. The chimeric polypeptide of claim 2 comprising amino acid residues Met99 to Leu317 of SEQ ID No.
 8. 4. The chimeric polypeptide of claim 2, comprising a 25 to 100 amino acid residue fragment of an E6-BP^(SD-7) protein of SEQ ID No.
 8. 5. The chimeric polypeptide of claim 2, wherein said peptide is recombinantly produced.
 6. The chimeric polypeptide of claim 2 wherein said polypeptide comprises a 25 to 50 amino acid residue fragment of SEQ ID No.
 8. 7. An isolated or recombinant E6-BP^(SD-7) polypeptide encoded by a nucleic acid which hybridizes under highly stringent conditions of about 0.2× SSC at 50° C. specifically to the nucleic acid designated in SEQ ID No. 1, which binds E6 protein.
 8. A substantially pure preparation of a fragment of the E6-BP^(SD-7) polypeptide of SEQ ID NO: 8, said fragment comprising amino acid residues Ala194 to Asp218 of SEQ. ID No.
 8. 9. The preparation of claim 8, wherein said fragment comprises Met99 to Leu317 of SEQ ID No.
 8. 10. The preparation of claim 8, wherein said fragment is 25-200 amino acid residues in length. 